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■i TRANSPORTERS AND ION CHANNELS 

' TECHNICAL FifiLD , 

• I 

I • ■ ' . , _ 

This invention relates to nucleic acid and amino acid sequences of transporters and ion channds 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of transport, neurological, 
muscle, immunological, and cdl proliferative disorders, and in the assessment of the effpcts of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of ttanspo^ers. and 
ion channds. . . 

10 ' BACKGROUND OF THE INVENTION 

Eukaryotic cdls are surrounded and subdivided into functionally distinct organelles by . 
hydrophopic lipid bilayer membranes which are highly impameable to H^ost polar molecules. Ceils and 
organelles require transport protdns to.import and export essential nutrients and metal ions including 
K*", NH4*, Pj, S04^, sugars, and vitamihs, as wdl as various metabolip waste products. Transport" 

15 proteins also pla^^ roles in antibiotic resistance, toxin seoretion, ion balance, synaptic neurotransmission, 
kidney function, intestinal absorption, tumor growth, and other diverse cell functions (Griffith, J. and C. 
Sansom (1998) The TransDortCT Facts Book, Academic Press, San Diego CA, pp. 3-29), TranspilJrt 
can occur by a passive concentration-dependent mechanism, or can be linked to an energy source sudi 
as ATP hydrolysis or an ion gradient. Protdns that function in transport include carrier protdns, whidi 

20 bind to a specific solute and undergo a conformational change that translocates the boimd splute across 
tbe membrane, and channel proteins, which fcffm hydrophilic pores that allow specific solutes to diffuse 
through the membrane down an dectrochemicai solute gradient 

Carrier proteins which transport a single solute ftom one side of the membrane to the other 
are called uniportCTS. In contrast, coupled triansportexs linkihe transfer of one solvtewito 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney q>itheliiun contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cdl down its electrochemical gradient and brings the solute into the cdl with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 

30 Na*/lCATPase system. Sodium-coupled transporters indude the mammalian glucose transporter 
(SGLTl), iodide transporter (NIS), and multivitamin transporto: (SMVT). All ftree transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmicaUy-^ 
oriented N- and C-t©rmini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 
various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging 

1 
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techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and 
placenta,' and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate 
(Prasad, P.D.etal. (1998) J. Biol. Chem. 273:7501-7506). ' 
5 One of the largest families of transporters is the major facilitator sup^amily (MPS), also 

called the uniportar-symporter-antiporter family. MPS transporters are single polypeptide carriers 
that transport small solutes .in response to idn gradient?. Members of the MPS are found in all classes 
- of hving organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MPS transporters; found in eukaryotes all have a structure 

10 » comprising. 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1^ 
34). The largest family of MPS transporters is the sugar transporter family, which includes the seven 
glucose transporters (GLUT1-GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUTl provides many ceU types with their basal glucose requirements and 

15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver; GLUT3 regulates glucose supply to neutons; GLUT4 is responsible for 
insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake into skeletal nn^uscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infantile seizures and developmental delay, as well as glycogen storage disease, Panconi-Bickel 

20 syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 
219:713-725; Longo, N. and L.J. Elsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled sympprters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 

25 ' to have twelve transmembrane (TM) hdical domains with a large intracellular loop between TM6 and 
TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 
stoichiometrically with lactate during glycolysis. The best characterized H'*'-mondcarboxylate 
transporter is that Of the exytbrocyte membrane; which transports L-lactate and a wide range of other 
' aliphatic monocarboxylates, Otha: cells possess H^-lihked monocarboxylate transporters with differing 

30 substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have transporters that 
diffar in their values for cotain substrates, including stereoselectivity for L- over D-lactate^ and in 
their sensitivity to inhibitors. There are Na^-monocarboxylate cotransporters on the luminal surface of 
intestinal and kidney epithelia» which allow the uptake of lactate, pyruvate, and ketone bodies in these 
tissues. In addition, th^e are specific and selective transpcnters for organic cations and organic anions 
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in organs including the kidnpy, intestine and liva:. Organic anion transpqrtersf are selective for 
hydrophobic^ charged motecules with dectronrattracting side groups. Organic ca:tion transporters^ such 
as the ammonium transporter, mediate the secretion of a variety of drugs and endogenous' metabolites, 
and contribute to the maintenance of inta:cellular pH (Poole, R.C.. and A.P. Halestrap (1993) Anx. J. 
5 Physiol. 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and I. 
Haggstrom(1993)J,BiotechnoL 30:339-350). . . 

ATP-binding cassetteXABC) transporters are m^Ubers of a superf amily of membrane proteins 
that transport substances ranging from small molecules such as ions, sugars, amino adds, pq)tides, and 
phospholipids^ to lipopeptides, large proteins, and complex hydrophobic drugs. ABC transporters 

10 consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
energy reqilired for transport, and two membrane-spanning domains (MSD), each containing six 
• putative transmembrane segments. These four modules ihay be encoded |Dy a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by sq)arate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 

15 homo^ and heterodimors, 5uch as Tapl and Tap2, the endoplasmic reticulimi-based major 

histocompatibility (MHC) peptide transport System. Sev^al genetic diseases are attributed to defects in 
ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis' 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleufcodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal m^braneprptdn-7D, PMP70), and hyperinsulinemic hypbglycenna 

20 (sulfonylurea receptor, SUR). Ov^expression of the multidrug resistance (MDR)protdn, another 

ABC transporter, in human canc^ cdQs makes the cells resistant to a variety of cytotoxic drugs used in 
ctemolherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, seienitun, 
nickel, and diromiimi are important as cofactors for a number of enzymes. For example, copper is 

2S involved in hemoglobin synthesis^ connective tissue metabolism, and bone development, by acting as a 
cofactor in oxidoreductases such as superoxide disihutase^ ferroxidase (ceruloplasmin), and lysyl 
(^dase. Copper and oth^ metal ions must be provided in the diet, and are absorbed by transporters in 
the gastrointestinal tract Flaana protdns transport the metal ioiis to the Mver and other target o^^ 
where specific transport's move the ions into cdls and cellular organdies as needed. Imbalances in 

30 metal ion metaboliism have been associated with a numbM* of disease states (Danks, D.M^ (1986) J. 
Med Genet 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 

3 
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Fatty acid transport protein <FATP), an integral membrane protdn with four transmembrane siegments, 
is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as miisCle, heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1' cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T. Y. et al. (1998) J. 
5 Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins arQ transmembrane-spanning proteins which transport ions and 

charged metabolites between the cytosol and the mitochondrial matrix. Examples include the' ADP, 

I. 

ATP carrier protein; the 2-oxoglutarate/malate carria-; the phosphate carrier protean; the pyruvate 
carrier; the ditarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricarboxylate carrier which transports citrate and malate; and the Graye's disease carri^ protein, a 
protdn recognized T3y IgG in patients with active Gravels disease, an autoimmune disordo* resulting in 
hyperthyroidism. Proteins in this family consist of three tandem repeats of ah approximately 100 amino 
add domain, each of which contains two transmembrane regions (Stryer, L, (1995) Biochemistry, W.H. 
Freeman and Company, New York NY, p. 55 1 ; PROSITE PDOCOOl S9 Mitochondrial energy transfo- 

15 proteins signature; Online Maiddian Inharitance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, whidi create 
proton leaks across the inna- mitochondrial membrane, thus uncoupling oxidative phosphorylation from 
ATP synthesis. The result is en^gy dissipation in the form of heat. Mitochondrial uncoupling protdns 
have been implicated as modulators of thermoregulation and metabolic rate, and hav6 been prqposed as 

20 potential targets for drugs against metabolic diseases such as obesity (Ricguier, D. et aL (1999) J. Int 
Med, 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the mov^ent of 
ions across the plasma membrane. The movement of ions reiiuires ion channeJs, which form lon- 

25 selective pores within the membrane. Tbsxe are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the ena-gy obtained firom ATP hydrolysis to actively 
transport an ion against the ion's concentration gradi^t. Gated ion channels allow passive flow of an 
ion down the ion*s electrochemical gradient undo- restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) dectrical impulse 

30 conduction down the axon of a nerve ceQ, 2) transport of molecules into ceils against concentration 
gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a ceJl. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient 

4 
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These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na^-K* ATPase, Ca^% ATPase, and H*- ATPase, are activated by a 
phosphorylation event. P-class ion transporters are responsible for inaintaining resting potential . 
distributions such that cytosolic concentrations of Na^ and Cat^* are low and cytosolic concentration of 
5 is high. The vacuolar (V) class of ion transporters includes H"*^ pumps on intracellular organelles, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
the lumen of these organelles that is required for functi9a The coupling factor 0?) class consists of H* 
pumps in the mitodbondria. F-class ion transporters utilize a proton gradient to generate ^TTP from 
ADP and inorganic phosphate (Pj). 

10 * TheP-ATPasesarehexamersof alOOkDsubunitwilhtentransmem^ * 

several large cytoplasmic regions that may play a role in ion binding (Scarborough* G. A. (1999) Cuix. 
Opin. Cell Biol. 11:517-522), The V-ATPases are comi)Osed of two fimcttonal domains: 1^^ 
— domain, a peripheral complex responsible for ATP hydrolysis; and the Vq domain, an int^ral 

complex responsible for proton translocation across the membrane. The F-ATPases are structurally 

15 and evolutionarily related to the V- ATPases. The F-ATPase Fq domain contains 12 copi^ of the c 
subunit, a highly hydrophobic protein composed of two tr ansmetabrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transiKMt. The V-ATPase V,, dioinain 
' contains three types of homologous c subunits with four or five transihembrane domains and the 

essential carboxyl group in TM4 or TM3. Both types of complex also contain a singje a subunit that 

20 maybeinvolved inregulatingthepHdqpendenceof activity (Forgac,M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the ceU is utilized in many processes involving carriear proteins and 
gated ion chaimels. Carrier proteins utilize the resting potential to transport molecules into and out of 
the cell. Amino acid and glucose transport into many cells i^ linked to sodium ion co-transport 
25 ' (symport) so that the movement of Na* down an electrochenucal gradient drives transport of the other 
molecule up a concentration gradient Similarly, cardiac muscle links transfer of Ca^^ out of the cell 
with transport of Na* into the cell (antiport). 

Gated Ion Channels ^* 

Gated ion chaimels control ion flow by regulating the opening and closing of pores. The 

30 ability to control ion flux through various gating mechanisms allows ion channels to mediate such 
diverse signaling and homeostatic functions as neuronal and radocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion chaimds are categorized according to 
the manner of regulating the gating function. Mechanically-gated channels open their pores in 
response to mechanical stress; voltage-gated channels (e.g., Na*, K*^, Ca^, and Q* chaimels) open 

35 their pores in response to changes in membrane potential; and ligand-gated channels (e,g„ 

5 ■ 
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I 

acetylcholine-, serotonin-, and glutaniate-gated cation diannels, and GABA- and glyciiie-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or nenrotransmitt^. * 
The gating properties of a particular ion channel (ie., its threshold 'for and duration of opening and 
closing) are sometimes modulated by association with auxiliioy dlaimel proteins Mid/or post 
5 translational modifications^ such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the sisnses of 
touch, hearing, and balance, 2tnd also play important rojes in cdl volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
10 ' stress on the cell membrane and conduct both Ca^"^ and Na* (Suzuki, M; et al. (1999) J. Biol. Chem.» 
274:6330-6335). * 

, ^ The pore-forming subunits of the voltage-gated cation channels form a superf arriily of ion 
channel proteins. Hie characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 

15 carboxy tennini. In the Na* and Ca^"^ subfamilies, this domain is repeated four times, while in the 
channel subfamily, each channel is formed from a tetramer of either identical or dissimiliar subunits. 
The P region contains information specilying the ion selectivity for the channel. In the case of 
channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:11651-11656). 

20 Voltage-gated Na* and K* channds are necessary for the Junction of dectrically excitable cdls, 

such as nerve and muscle cells. Action potentials, which lead to neurotransmitter rdease and muscle 
contraction, arise from large, transient changes in the permeability of llie mraibrane to Na and K* 
ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na* channds. 
Sodium ions flow into the cdl, ftirther dq)olarizing the mraibrane and opening more voltage-gated Na* 

25 channels, which propagates the depolarization down the length of the cdl. Depolarization also qpMS 
voltage-gated potassium channds. Consequaitly, potasisium ions flow outward, which leads to 
rq)olarizationof thenaembrane. Voltage-gated diaiindsutiUze charged residues in the four^ 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 
which time the channd spontaneously converts into an inactive state that cannot be opened irrespective 

30 of the membrane potential. Inactivation is mediated by the channel's N-terminus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

Voltage-gated Na* channels are heterotrimeric complexes composed of a 260 kDa pore-forming 
a subunit that associates with two smaller auxiliary subunits, ^1 and P2. The ^2 subunit is a integral 

6 
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jiiembrane glycoprotditi that .contains an . extracellular Ig domain, and its association with a and pi 
subunits correlates with increased functional expression' of the channel, a change in its gatmg • 
properties, as well as an increase iAi whole cell capacitance due.to ah increase in mfembrane surface area 
asom, LX. et*al. (1995) Cdl 83:433-442), 
5 Non voltage-gated Na* diannels include the members of the amilori^^ 

channel/degenerin (NaC/DEG) family. Channea subunits of this, family are thought to consist of two 
transmembrane domains flanldng.a long extra,cellular loop*, .with the amino and carboxyl termini located 
within the cell. The NaC/DEG family includes the epithelial Na* channel QENaC) involved in Na* 
reabsorption in epitheUa including the airway, distal colon, cortical collecting duct of the kidney, and 
10 exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 

syndrome (J)seudoKyperaldosteronism). The NaC/DEG family also includes the recently characterized 
H*-gated cation channels or acid-sensing ion channels (ASiC), ASIC suljunits are expressed in the 
brain and form heteromultimeric Na'^-permeable chiannels. These channds require acid pH fluctuations 
for activation. ASIC subunits show'hbmology to the degenerins, a family of mechanically-gated 
15 channels originally isolatejd from C. elegans. Mutations in the degenerins cause neurodegeneration. 
ASIC subunits may also have a role in neuronal function, or in pain perception, isince tissue acidosis 
causes pain (Waldmann, R. and M. Lazdiinski (1998) Curr. ppin. Neurobiol. 8:418-424; Eglen, R.M. 
et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

channels are located in all cell types, tod may be regulated by voltage, ATP concentration, 
20 or second messenger such as Ca^* and cAMP. In non-excitable tissue, channels are involved in 
protein synthesis, control of endocrine secretions, and the maintenance of osmotic equilibrium across 
membranes; In neurons and othor excitable cells, in addition to regulating action potentials and 
repolarizing membranes, channels arexesponsible for setting resting membrane potential. The 
cytosol contains non-difiusible anions and, to balance this net negative charge, the cdl contains a Na*- 
25 K* pump and ion dianneJs that provide the redistribution of Na*,K\ and Cr. The pump actively 

transports Na* out of ttie cdl and into the ceD in i 3:2 ratio. Ion channels in the plasma mmibrane 
allow K* and CI ' to flow by passive diffusion. Because of the high negative charge within the cytosol, 
Cr flows out of the cell. The flow of K* is balanced by an dectromotive force pulling into the ceQ, 
and a concentration gradient pushing out of the cell. Thus, the resting membrane potential is 
30 primarily regulated by K'-flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shaker -like superf amily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic p 
subunits lhat alter channel inactivation kinetics. The Shaker -like channel family includes the voltage- 

7 
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gated ctaaxinels as well as the delayed rectifier type channels such as the human ether-a-go-go 
related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Gurran, M.E. (1998) 
Curr. ppin: Biotechnbl. 9:565-57!2; Kaczorowski, GJ. and Mi. Garcia (1999)' cW. Opin. Chem. 
Biol. 3:448-458). 

5 A second superfaniily of channels is composed of the inward rectify 

Kir channels have the property of preferentially conducting IC; currents in the inward (Jirectioiiu These 
proteins consist of a single pbtasspm selective pore domain and two transmembrane domams, v/hich 
correspohd to the fifth and sixth transmanbrane domains of voltage-gated K* channels. Kir subunits 
also assbdate as tetramers. The Kir family includes ROMKl , mutations in which lead to B artter 

10 syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac pacemaker 
activity, sdizures and epilepsy, and insulin regulation (Doupnik, C.A; et al. (1995) Curr. Opin., 
Neurobiol. 5:268-277; Curran, supra) . ^ 

The recently recognized TWIK K'*' channel family includes the mammalian TWIK-1 , TREK-1 
and TASK proteins. Members of tlii's family possess an overall structure with four transmembrane 

15 domains and two P doms^ins. These proteins are probably involved in controlling the resting potential 
in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca^"*" channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics, L-type Ca^* channels are predominantly 
expressed in heart and skeletal muscle whore they play an essential role in excitation-contraction 

20 coupling. T-type channels are important far cardiac pacemaker activity, while N-type and P/Q-type 
channels are involved in the control of neurotransmitter release in the central and pmpheral n^ous 
system. TheL-type and N-type voltage-gated Ca^^ channels have been purified and, though their 
functions differ dramatically, they have similar subunit con^ositions. . The diannels are composed of 
three subunits. Hie subxmit forms, the membrane pore and voltage sensor, while the a^S and P 

25 subunits modulate the voltage-dependence, gating properties, and the current amplitude of the channel. 
' These subunits are encoded by at least six Oj, one ojS, and four p geneS; A fourth subunit, y, has been 
identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chan. 213:2361-2361 \ McQeskey. E.W. 
(1994) Curr, Opm. Neurobiol. 4:304-312). 

The transimt receptor family (Tip) of calcium ion channels are thought to mediate capadtative 

30 calcium entry (CCE). CGE is the Ca^* influx into cells to resupply Ca^* stores depleted by the action of 
inositol triphosphate (IP3) and other agents in response to numerous hormones and growth factors: Tip 
and Tip-like were first cloned fi-bm Drosophila and have similarity to voltage gated Ca2+ channels in 
the S3 through S6 regions. This suggests that Trp and/or related proteins may form mammalian CCC 
entry channds (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) J. Biol. Chem. 

8 
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272:29672-29680). Mdastatin is a gene isolated in both the mouse and human, and whose expression 
in nielanoma cells is inversely cx)irelated witi^^ The human cDNA • 

transaipt corresponds to a 1533-amino acid protein havi'rig homology to members of the Trp family. It 
has been proposed that the combined use of malastatin mRNA expression status and tumor thickness 

5 might allow for the detmnination of sub^oups of patients at boOi low and high risk for developing 
metastatic disease (Duncan, L.M. et al (2001), J. Oin. Oncol. 19:568-576). 

Chloride channels are*necessary in endocrine secretion and jjn regulation of cytosolic and 
organelle pH. In secretory epithelial cells, Q ' enters the cell across a basolateral membrane through an 
Na KVCl ' cotransporter, accumulating in the cell above its electroch^cal equilibrium concentration. 

10 I Secretion of CI " from the apical surface, in response to hormonal stimulation^ leads to flow of Na * ahd 
water into the secretory lumen. The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
CFTR is a member of the ABC transporter family, and is composed of two domains each consisting of 
six transmembrane domains followed by a nucleotide-binding site. Loss of CFTR function decreases 

15 transepithelial water secretion and, as a result, the layers of mucus that coat the respiratory tree, 
pancreatic ducts, and intestine are ddiydrated and difficult to cleat. The resulting blockage of these 
sites leads to pancreatic insufficiency, ^^meconium ileus", and devastating "chronic obstructive 
pulmonary cfisease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltage-gat^ chloride channels (CLC) are charact^ized by 10-12 transmembrane 

20 domains, as wdl as two small globular domains Imown as CBS domains. The CLC subunits probably 
function as homotetrama-s. CLC proteins are involved in regulation of cdl volume, membrane 
potmtial stabilization, signal transduction, and transq>ithelial transport Mutations in CLC-1, 
e:?q>ressed predominantly in skeletal muscle, are responsible for autosomal recessive gen^alized 
myotonia and autosomal donoihant myotonia cong^ta, while^^mutations in the kidney chaimd CLC-5 

25 lead to kidney ston^ (Jentsch, T (1996) Cuir. Opin. NeurobioL 6:303-310). 

Ligand-gated channels op&x fhdr pores when ah extracellular or intracellular mediator binds to 
the channd, Neurotransmitter-gated channels are channels that op^ when a neurotransmitter binds to. 
their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle cdls. 
'There are two types of neurotransmitter-gated channels. Sodium channds open in response to 

30 excitatory neurotransmittars, such as acetylcholine, ^utamate, and serotonin. This opening causes an 
influx of Na"*^ and produces the initial localized depolarization that activates the voltage-gated chaimds 
and starts the action potential. Chloride channels open in response to iiiiibitory neurotransmitters, such 
as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. Neurotransmitter-gated ion channels have four 
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transmembrane domains and prjgbably function as pentamers (Jentsch, sut)ra> . Amino acids in the 
second transmembrane domain appear to be important in determining channel permeation and * 
selectivity (Satlia:, W A, et al. (I9W) Qirr. Opfi^. ^ * * ; 

Ligand-gated channels can be regulated by iptracellular 'second messengers. For example, 
5 calcium-activated K"*^ channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens channels to modulate the magnitude of the action potential (Ishi fet al., 
supra) . The large conductance (BIQ channd'has been pmified from brain and its subunit coniposition 
determined* The a subunit of the BK channel has seven rather than six transmembrane domains in 
contrast to vo'ltage-gated K"*^ channel^. The extra transmembrane domain is located at the subunit N- 

10 tOTtiihus, A 28-amlrio-acid stretch in the C-tmninal region of the subunit (the "caldum bowl" region) 
contains many negativdy c±iarged residues and Is thought to be the region reisponsible for calduin 
binding. The' P subunit consists of two transm^ilbrane domains connected by a glycosylated 
extracellular loop, with intracellular N- and C-t^mini (Kaczorowski, supra : Vergara, C. et al, (1998) 
Curr. Opia Neurobiol. 8:321-32*9). . * , 

15 Cyclic niicleotidel-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

examples of these are the cAMP-gated Na * channels involved in olfaction and the cGMP-gated cation 
dbannds involved in vision. Both systems involve ligand-mediated activation of a G-protein coupied 
receptor which then alters the lev^l of cyclic nucleotide within the cell. CNG channels also rq>res^ a 
major pathv\^ay for Ca^* entry into neurons, and play roles in neuronal develqpment and plasticity. 

20 CNG channels are tetranners containing at least two types of subimits, an a subunit which can form 
functional homomeric channds» and a p subunit, which modulates the chaimel properties. All CNG 
subunits have six transmembrane domains and a pore forming region between the fifth and sixth 
transniembrane domains, siniilar to voltage- giaited K* chaiihels. A large C-terminal domain contains a 
cyclic nucleotide binding domain, while the N-terminal domiain confers variation among chaamd 

25 subtypes (ZufaH, P. et al. (1997) Cuix. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signalling proteins. Many channels have sites for phosphorylation by one or more protein 
kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of 
which regulate ion chaimel activity in cells. Kir channels are activated by the binding of the Gpy 

30 subimits of heterctrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. Biol. 
1 1 :503-508). Other proteins are involved in the localization of ion channels to specific sites in the cell 
membrane. Such proteins include the PDZ domain proteins known as M AGUKs (membrane-associated 
guanylate kinases) which regulate the clustering of ion chaimels at neuronal synapses (Craven, S.E. and 
D.S. Bredt (1998) Cell 93:495-498). 

10 
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Disease Correlation 

•It 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane--bound transporters 
and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose 

5 malabsorption syndrome, hypercholesterolemia, von Gierke diisease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an.inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, IJartup disease, andFanconi disease (van't Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann, Intern. Med. 1?0:21 8-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

10 ' , Human diseases caused by iputations in ion channel genes include disorders of skeletal musble, 
cardiac muscle, and the central nervous system. Mutations in the pore-formii^ subunits of sodium and 
, pMoride channels cause myotonia, a muscle disorder in which xeQaxation after volimtary contractiaii is 
delayed. Sodium channei myotonias have been treated with diaimelbloc*^ Mutations in muscle 
sodium and calcium channels cause fornos of periodic paralysis, whilb mutations in the sarcoplasmic 

15 calcium release channel, T-tubiile calcium channel, and muscle sodium channel cause malignant 

hyperthermia. Cardiac arrythmia disordas such as the long QT Syndromes and idiopathic ventricular 
fibrillation are caused by mutations in potassium and sodium channels (Coopo*, E.C. and L. Y. Jan 
(1998)ProQ.NatL Acad. Sci. USA 96:4759-4766). All four known human idiopathic epilepsy genes 
code for ion channd protdns (Berkovic, S.F. and IJE. Sdieffear (1999) Curr. Opin. Neurology 12:177- 

20 1 82). Other neiirologicsa disord^s such as ataxias, hemiplegic migraine and hereditary deafeess caii 
also result from mutations in ion channel genes (Jct, J. (1999) Curr. Opin, Neurobiol. 9:274-280; 
Coopar. supra) . 

Ion chaimdshaye been the target for many drug therapieis. Neurptransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, ansdety, depression, and schizophrenia. 

25 Voltage-gated chaiuiels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, G.P. and L.S. Narasimhan (1997) Adv. Piiarmacol. 39:47-98). 
Various classes of ion chaimds also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanilloid-gated ion channels, which are activated 
by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as lidocaine and 

30 mexiletine which blockade voltage-gated Na* channels have been useful in the treatment of neuropathic 
pain (Eglen> supra) . 

Ion channels in the immune system have recently been suggested as targets for 
inamunomodulation. T-cell activation depends upon calcium signaling, and a divCTse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking agents 

11 
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can inhibit secaretion of lymphoMnes, cell proliferation, and Wiling of target cdls. A pq)tide antagonist 
of the T-cell potassium channel Rvl.3 was found to suppress ddayed-type hypersensitivity and ' 
allogenic responses in pigs, validating the idea of dianhel blockers as safe and efficacious 
inimunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Cvttr; Opin! BiotechnoL 8:749-756). 
5 The discovery of new transporters and ion channels, and the polynucleotides encoding th^, 

satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatmmt of transport, neurological, muiscle, inmiupological, and cell prolifarative disorders, and in 
the assessment of the effects of exogenous compounds on the expression of nucldc acid ^d amino acid 
sequences of transporters and ion thannels. 

10 ' , . . ■ » 

SUMMARY OF THE INVENTION 
, . The invention features purified polypeptides, transporters and ion channi&ls, referred to 
— collectively as 'TRICH" and individually as ^TTRICH-l,*' 'TRICH-2/* *TRICH-3/' *TRlCH-4,'' 

'TRICH-5," "TRICH-6," 'TRICH-7," 'TRICH-8," ''TRICH-P," 'TRICH-10,'' 'TRICH-11," ' 

15 *TRICH-12," "TRICH-13," "TRICH-14," "TRICH-15," "TRICH-16,'* "TRICH-17," *TTIICH-18;" 
"TRICH-19," "TRICH-20,'* *TRICH-21,*' "TRICH-22," •TRldH.23," *TTUCH^24," *TRICH-25," 
"TRICH-26," 'TRICH-27." 'TRICH-28," 'TRICH-29,'' "TRICH-30/' 'TTHCH-Sl," and 'TRICH- 
32." In one aspect, the invention provides an isolated polypqjtide selected from the group consisting of 
a) a polypeptide comprising an amino add sequence selected from the group consisting of SEQ ID 

20 NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -32, c) a biologically 
active fragment of a polypeptide having an amino aidd sequence selected from the group consisting of 
SEQ ID NO:l-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32. In dne altamative, the invMtion provides an 

25 isolated polypq)tide comprising the amino add sequmce of SEQ ID NO:l-32. 

The invention fiirther provides an isolated polynucleotide encoding a polypeptide selected from 
the group consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypq)tide comprising a naturally occurring amino add sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

30 32, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. In one alternative, the 
polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO: 1 -32. In . 
another alternative, the polynucleotide is sdected from the group consisting of SEQ ID NO:33-64. 

12 
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Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked'to a polynucleotide encoding a p6lypq)tide selected from the group consisting 
of a) a polypeptide cotaprifeing an lamino acid sequence selected from the group cohsisting of SEQ ID 
NO:l-32, b) d polypeptide comprising a naturally occurring amijio acid sequence at least 90% idratic^ 
5 to an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, c) a biologically 
active fragment of a polypeptide having an amino acid sequmcq selected from the group consisting of 
SEQ ID NO:l-32, and d) an ifnraunogenic frfigment of a p9lypq)tide having an amino add sequence 

selected from the group consisting of SEQ ID NO:I-32. In one alternative* the invention provides a cdi 

• ■ ■ ■ • . 

transformed \|/ith the recombinant polynucleotide. In another alternative» the invention provides a 

10 transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid Sequence selectejl from the group consisting of 
SEQ ID NO:l-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence 'selected from the group consisting^ of SEQ ID NO:l-32, c) a 

15 biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -32, and d) an imtmunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32. The method comprises a) • 
culturing a cell und^ conditions suitable for expression of the polypeptide^ whertin said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 

20 polynucleotide encoding the polypeptide, and b) recovering the polypeptide so e>q)ressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypqptide selected from the group consisting of a) a polypq)tide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising a naturally 
occurring amino add sequence at least 90% identical to an amino acid sequence selected from the group 

25 consisting of SEQ ID NO: 1 -32, c) a biologically active fragment of a polypeptide having an amino add 
sequence selected from the group consisting of SEQ BD NO: 1-32, iand d) an inmiunogenic fragm^ of a 
polypeptide having an amino add sequence sdected from the group consisting of SEQ ID NO:l-32. 

Hie invention ftutho" provides an isolated pKflynucleotide sdected from the group consisting of 
a) a polynucleotide corhprising a polynucleotide sequence sdected from the group consisting of SEQ ID 

30 NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

13 
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Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequmce of a polynucleotide sdected from the group consisting of 
a) a polynucleotide comprising a polynudeotide sequaice selected from the grou^) consisting of SEQ ID 
NO:33-64, b)* a polynucleotide comprising a naturally occurring^polynucleotide sequence at least 90% 

5 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complempntary to die 
polynucleotide of b), and e) an RN^ equivalent of a>d). The niethod comprises a) hybridizing th^ 
sample with a probe, comprising at least 20 contiguous nucleotides comprising a sequence 
complementairy to said target polynucleotide in the sample, and which probe specifically hybridizes to 

10 said target polynucleolidej undea: conditions whereby a hybridization complex is formed between said 
probe and said target jjolynucleotide or fragments tha?eof , and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target.polynucleotidein a sample, said 

15 target polynucleotide having a sequence of a polynucleotide selected from the 

polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence sdected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a jiolynucleotide complementary to the 

20 polynucleotide of b), and e) an RNA equivalent of a)-d). the method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reiaction amplification, and b) 
detecting the piesenco or absence of said amplified target polynucleotide or fragment thereof, and, 
optionally, if present, the amount thereof.' 

The invention furthCT provides a composition comprising an effective amount of a polypeptide 

25 selected from the group consisting of a) a polypqjtide comprising an annino add sequence selected from 
the group consisting of SEQ ED NO:l-32, b) a polypeptide conq)rising a naturaHy occurring ammo add 
sequence at least 90% identical to an amino add sequence seiected from the group consisting of SEQ 
ID NO: 1 -32, c) a biologically active fragment of a polypeptide having an amino add sequence sdected 
from the group consisting of SEQ ID NO:l-32, and d) an inununogoiic fragment of a polypeptide 

30 having an amino acid sequaice selected from the group consisting of SEQ ID NO:l-32, and a 

pharinaceutically acceptable excipient In one embodiment, the composition comprises an amino add 
sequence sdected from the group consisting of SEQ ID N0:1 -32. Tlie invention additionally provides a 
method of treating a disease or condition assodated with decreased expression of frmctional TRICH, 
comprising administering to a patient in need of sudi treatment the composition. 

14 
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The invration also provides a method for screening a compotuid for effectiveness as an 
agoni!^ of a polypeptide selected from the group consisting of a) a polypeptide comprisii^ an amino * 
acid sequence selected from the group consisting of SEO ID NO:l-32, a pcflypeptide comprising a 
naturally occurring amino acid sequence at least 90% id^tical to an amino acid sequence selected from 
5 the group consisting of SEQ ID NO:l-32, c) a biologically active fragment of a polypq)tide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected l^om the group consisting of SEQ ID 
NO:l -32. Tht method comprises a) exposing a sample comprising the jxdypeptide to a compound, 
and b) detecting agonist activity in the sample. In one alternative, the invention provides a 

10 » composition comprising an agonist compound identified by fee method and a pharmaceutically » 
acceptable excipieht In another alternative, the invmtion provides a method of treating a disease or 
^ condition associated with decreased expression of functional TRICH, comprising administmng to a 
patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

15 an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide 
comprising a naturally occurring amino add sequence at least 90% identical to an amino acid seqii^ce 
selected from the group consisting of SEQ ID NO;l-32, c) a biologically active fragmmt of a 
polypeptide having an amino acid sequmce selected from the group consisting of SEQ ID NO:l"32, and 

20 d) an immunogenic fragment of a polypq)tide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-32. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a coipposition comprising an antagonist compound identifi^ by the method and a 
pharmaceutically acceptable excipient In anott>eraIternatiy&, the invaition provides a method of 

25 treating a disease or condition associated with overigjcpression of functional TRICH, comprising 
admimstering to a patient in need of such treatment the composition^ 

The invention iwth^ provides a method of screening for a comi>ound that specifically binds 
to a polypeptide sdeaed from the group consisting of a) a polypeptide comprising an amino acid 
• sequence sdected from the group consisting of SEQ ID NO:l-32, b) a polyjpeptide comprising a 

30 natui;ally occurring amino add sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypq>tide having an 
amino add sequence selected from the group consisting of SEQ ID NO:l-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The method comprises a) combining the polypeptide with at least one test compound nnder 

15 
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suitable conditions, and b) detecting binding of the polypeptide to the test compound, th^eby 

•II . _ • 

identifying a compound that specifically binds to the polypqp • 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypqitide comprising an amiiio 
5 add sequence sdected from the group consisting of SEQ ID NO:l-32, b) a polypq)tide coraprismg a 
naturally occurring amino acid sequence at least 90% Identical to an amino add sequence sdected from 
the group consisting of SEQ to NO: 1-32, c) a biologiqally active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 -32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence ^elected from the group consisting of SEQ ID 

10 * NO: 1-32. The method comprises a), combining the polypeptide with at least one test compound 
conditions permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide 
, in the presence of the test compound, and c) comparing the activity of the polypeptide in the presence 
of the test compound with the activity of the polypeptide in the absence of the test compound, 
wherein a change in the activity of the polypeptide in the presence of flie test compiound is indicative 

15 of a compound that modulates the activity of the polypeptide^ 

The invention further provides a method for screening a 'compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:33-64, tiie method comprismg a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 

20 expression of the target polynucleotide. 

The invention further provide a method for assessing toxicity of a test compound, said 
metiiod comprising a) treating a biological sample containing nucleic adds with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample witfe a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from fife group consisting of i) a polynucleotide 

25 comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) a 
polyiiucleotide comprising a naturally occuipng polynucleotide sequence at least 90% identic£a to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 
polynucleotide having a sequence complementary to 1), iv) a polynucleotide complementary to the 
: polynudeotide of ii), and v) an UNA equivalent of i)-iv). Hybridization occurs under conditions 

30 whereby a specific hybridization complex is formed between said probe and a target polynucleotide in 
the biological sample, said target polynucleotide selected from the group consisting of i) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 

35 polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide complementary to the 

16 
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polynucleotide of ii), and v) an RNA equivjdent of i>iv). iUtematively, the target polynucleotide 
comprises a fragment of a polynucleotide sequence selected from the group consisting of i>v) above; 
c) quantifying the ambunt of hybridization cbniplex; and d) cpmparing the amouAt of hybridization 

.it' 

complex in th'e treated biological sample with the aiaount of hybridization complex in an untreated 
5 biological sample, wharein a difference in the amount of hybridization complex in the treated 
biologicsa sample is indicative oftoxicity of the test compound,. . 

■ • . . .. . .. .-. . .. .■ ...... • . 

BRIEF DESCRIPTION OF THE TABLES 

Table} 1 summarizes the nomenclature for the ftiH length polynucleotide and polypeptide 
10 sequences of the present mv^oa 

Table 2 shows the GenBank identificatipn number and annotation of the nearest GenBank 
homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GraiBankhomolog is also shown. 

Table 3 shows structural, fektbres of polypeptide sequences of the invention, including predicted 
15 motifs and domains, along with the methods, algorithms, and searchable databases used for analysis of 
the polypeptides. • 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequmces of the inventioni along with selected fragments of the polynucleotide 
sequences. ' 
20 Table 5 shows the representative cDNA library for polynucleotides of the invention. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDN A libraries shown in Table 5 . 

Table 7 shows ttie tools, program3, and algorithms used to analyze the polynucleotides and 
polypq)tides of the invention, along wifli applicable descriptions, references, and threshold parameters. 

25 

DESCRIPTION OF THE INVENTION 

Before the present protdns* nucleotide sequences, and methods are desadbed, it is understood 
that this invention is not limited to the particular machines, materials and mefliods described, as these 
may vary. It is also to be understood that the terininology used herdn is for the purpose of describing 
30 particular embodiments only, and i^ not intended to limit the scope of the present invention which will 
be limited oidy by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a/' "an,'' 
and **the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host ceU" includes a plurality of such host cells, and a refa^ence to "an antibody" is a 
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refe-ence to one or more antibodies and equivalents tho-eof imown to those skilled in the art, and so 
forth. 

Unless defined oth^wise, all technical and scientific teinns used heardn have the same meamngs 
asconunonlyunderstbod by one of ordinary sm in the art Although 
5 any machines, materials, and methods similar or equivalent to those described herdn can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described 
All publications mentioned herein are cited for the purppse of describing and disclosing the cell lines, 
' protocols, reagents and vectors which are rq>orted in the publications and which might b^ used in 

connection with the invention. Nothing herein is to be construed a? an admission that the invention is 
10 ' npt entitled to antedate such disclosui;e by virtue of prior invention. ' 
DEFINITIONS 

"TRIGH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovin^ pordne, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

15 The term "agonist'* refers to a molecule which intensifies or mimics the biological activity of 

TRICH. Agonists may include proteins, nucldc acids, carbohydrates, small molecules, or any othier 
compound or composition which modulates the activity of TRICH dthar by direcQy interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "alleJic variant" is an atonative form of the gene encoding TRICH. Allelic variants may 

20 result from at leiast one mutation in the nuclac acid sequence and may result in altered mRN As or in 
polypeptides whose structure or fimction may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring foral Common mutational changes which give rise to 
allelic variants are gmerally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur jdone, or iii combination with the oth^s, one or more times in 

25 a given sequence. 

"Aita-ed" nucldc acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH, Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

30 the polynucleotide encoding TRICH, and improper or unexpected hybriffization to alleJic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encckling TRICH. 
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent TRICH. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 

18 



BNSDCX^ID: <WO_0204520A2J_> 



wo 02/04520 PCTA3S01/21448 

solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as lohg as the 
biological or immunological activity of TRICH is retained. For example, negatively charged amino 
acids may include aspartic acid alid glutamic acid, and positively charged amiiio*' acids may include 
lysine and arginine, Amino acids with imcharged polar side chains having similar hydrophilicity 

5 values may include: asparagine and glutamine; and serine and threonine, Amino acids with 
uncharged side chains having similar hydrophilicity values m^y include: leucine, isoleucine, and 
valine; glycine and alanine; ahd i)henylalanine and tyro^e, . ■ ^ ^ 

The terms "'amino acid" ahd "amino acid sequence'' refer to an oligopeptide, peptide, 
polypeptide, pr protdn sequence, or a fragment of any of these, and to naturally occurring or synthetic 

10 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protdn mcdecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the con^lete na^ye amino acid sequ^oe associated with the re(d^ 

"Amplification" rdates to tiie production bf additional cq?ies of a nucleic acid sequence. 
Amplification is generally carried dai using polymerase chain reaction (PCR) technologies well known 

15 in the art; » . 

The term "antagomst" refers to a molecule which inhibits pr attmuates the biological activity of 
TRICH. Antagonists may include proteins such as antibodies, nucMc acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly intaracting with TRICH or by acting on components of the biological patliway in wlridi TRICH 

20 participates. 

The term "antibody" refars to intact immunoglobulin molecules as wdl as to fragments tho-eof, 
such as Pab, F(ab02» ^ Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TRICH polypeptides can. be prepared using intact polypeptides o£ using fragments 
containing small peptides of interest as the immunizing antigexi. The polypeptide or oligopeptide used 

25 to immunize an animal (e.g.,amouse, arat, cmt arabbit) canbederivedfromthetranslationof RNA, or 
synthesized dhemically, and can be conjugated to a carrio* proteiii if desired. Commonly used carriers 
that are ch^mcally coupled to peptides include bovine serum albumin, thyroglobuMn, and keyhole 
lin^^et hemocyanin (KLH). Hie coupled peptide is then used to immunize the animal. 

The term "antig^c determinant" rdfors to that region of a molecule (i.e.^ an epitope) that 

30 niakes contact v^th a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies whidi 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
eaicit the immune response) for binding to an antibody. 
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The term "antismse" refes to any composition capable of base^pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modifieid backbone linkages* such as 
phosphorothioates, methylphosphonates, or henzylphosphonates; oligonucleotides having modified 
5 sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
mcKiified bases such as 5-methyl cytosine» 2*-deoxyuracil, or 7-dea2^ Antisense 
molecules may be produced t>y any method including ch^nical synthesis or transcription. Oilce , 
introduced into a ce^lj the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
10 translation. The designation "negative" or "minus" can refi^ to the antisense strand, and the 
. designation "positive" or "plus" can refer to the sense strand of a ref^ence DNA molecule. 

The term "biologically active" refers to a protein having stnicturial, regulatory, or biochemical 
ftmctions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICfi, or of any oligopeptide thereof, 
15 to induce a si)edfic immune response in appropriate animals or ceils and to bind w 
antibodies. 

"Complementary*' describes the relationship between two single-stranded nucldc acid 
sequences that anneal by base-pairing. For example, 5 -AGT-3' pairs with its complemmt, 
3'-TCA-5*. ' 

20 A "composition comprising a given polynucleotide sequence" and a "composition comprising a 

giveii amino add sequence" refiM* broadly to any composition containing the givm polynucleotide or 
amino acid sequence. Tbe composition may comprise a dry formulation or an aqueous solution. 
Compositions coniprising polynucleotide sequences encoding TRICH or fragments of TRICH niay be 
employed as hybridization probes. The probes may be stcrcMd in Ireeze-dried form and may be 

25 associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 

deployed in an aqueous solution containing ssQts (e.g., NaCl), detergrats (e.g., sodium dodecyl sulfate; 
SDS), and other coniponents (e,g., Denhardt's solution, dry milk, salmon spam DNA, etc.). 

"Consensus sequence' refors to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 

30 Foster City CA) in the 5* and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assonbled to 
produce the consensus sequence. 
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t . ... 

"Conservative amino add substitutions" are those substitutions fbat are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the fiinction of the 
protdn is conserved and not significantly changed by such substitutions. The table bdow shows ainiiio 
acids which may be substituted for an original amino acid in a protein and ^hich are regarded as 
conservative amino add substitutions. 



Original Residue 


Conservative SubstitutiQa 


Ala V 


Gly, Ser 


Arg 


» His, Lys • 


Asn 


Asp, Gin, His 


Asp , 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Gin 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


He 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 




Leu, ne 


Phe 


His, Met, Leu, Tip, Tyr 


Ser 


Cys, Tfar 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


He, Leu, Thr 



Conservative amino add substitutions geno'ally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for exan^le, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide can include, for example, replacement of hydrogen by an alkyl, acyl, 
hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide whidi retains at least one 
biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by gjycosylation, pegylation, or any similar process that retains at least one biological or immunological 
function of the polypq)tide firom which it was derived. 

A "detectable labtf' refers to a reportear molecule or enzyme that is capable of generating a 
measurable signal and is covalwitly or noncovalenfly joined to a polynucleotide or polypq)tide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
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absent gene or protein expression, determined by comparing at least two.diffarent samples. Such 
•II .• , ■ . . ■ 

comparisons niay be carried out between^ for example, a treated and an untreated sample, or a diseased 

and a normal sample. ' . * . . . ' . 

A "fragment*' is a unique portion of TRICH or the pdlynu'cieotide encoding TRICH which is 

5 identical in sequence to but shorter in length than the parent siequence. A fragment may comprise up 

to the entire length of the defined sequence, minus one nuclebtide/amiho acid residue. For example, a 

fragment may comprise from 5 to 1000 contiguous nucleotides or amino add residues; A fragment 

used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least i5, 10, 

15, 16, 20, 25, 30, 40, 50, 60, 75, ilOO, 150, 250 or at least 500 contiguous nucleotides or amino acid 

10 ' residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables,, and figures, may be encompassed by the present 

15 embodiments. 

A fragment of SEQ ID NO:33-64 comprises a region of Unique polynucleotide sequence that 
specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 

20 SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds areroutindy 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A fragment 
of SEQ ID NO: 1-32 comprises a region of unique iamino add sequence that specifically identifies 

25 • SEQ ID NO:l-32, For example, a fragment of SEQ ID NO:l-32 is useful as an immunogenic peptide 
for the development of antibodies that q)ectpcally recognize SEQ ID NO:l-32. The precise length of 
a fragment of SEQ ID NO:l-32 and Ihfc region of SEQ ID NO:l-32 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
. purpose for the fragment. 

30 A "full length" polynucleotide sequence is one containing at least a translation initiation cpdon 

(e.g. , methionine) followed by an open reading frame and a translation termination codon. A "fuU 
length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology** refers to sequence similarity or, into-chahgeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypqDtide sequences . 

35 The terms "percent identity** and "% identity,** as applied to polynucleotide sequences, refer to 
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the percentage of residue matches between at least two polynucleotide sequences aligned using- a 
Standardized algorithm, kucii an algorithm may insert; in a standardized and rq>roducible way, gaps in 
the sequences being compared in drd^ to optimize alignment between two sequ^ocSes, and flierefore. 
achieve a more meaningful comparison of the two sequ^ices; ... 

5 Percent id^tity between pdynucleotide sequmces may be deterniined^^^ 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 
alignment program. This progjram is part of the LASERGENE software pacjcage, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL Vis described in Higgins, D.G. 
andP.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 

10 For pairwise alignments of polynucleotide sequences, the default parameto's are set as follows: 

Ktuple=2, ^ap penalty=5, window=4, and ''diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent identity is reported by CLUSTAL V as tjie "percent similarity" betwe^ 
aligned pol)aiucleotide sequences. * 

Alternatively, a suite of coiimonly used and freely available sequence comparison algorithms is 

15 provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Intanet at 

httpy/www.ncbi.nlm.nih.gov/BLASTA The BLAST software suite includes various sequence ahalysis 
programs including **blastn," that is used to align la known polynucleotide sequence with other 
20 polynucleotide sequ^ces from a variety of databases. Also available is a tool called ''BLAST 2 
Sequences'* that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at ht^y/wwvi^,ncbi-rihanih.gov/garfrtd2.htnil. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed baow). BLAST 
programs are commonly used with gs^ and othar parameters iSet to default settings. For example, to 
25 compare two nucleotide sequaices, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2(X)0) set at default parameters. Such default parameters may be, for e?cample: 

Matrix: BLOSUM62 

Reward for match: J • 

Penalty for mismatch: -2 
30 Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 

Expect: 10 

Word Size: 1 J 

Filter: on 
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Percent idmtity may be measured over the length of an entire defined sequence^ for example^ as 

defined by a particular SEQ ID number, or may be measwed over a. shorter length, for example* over 

the length of a fragment taken firom a larger, defined sequence, for instance, a Mgnient of at least 20, at 

least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 

5 lengths are exen5)lary only, and it is understood that any fragment length supported by the sequences 

shown h^ein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 

percentage idMtity may be measurjcd. • *• , • . 

I. 

Nucldc acicj sequences that do not show a high degree of identity may nevertheless mcode 
similar amino acid sequences due to the degeneracy of the genetic code. ' It is understood thiat changes in 
10 a nucleic acid sequencfe can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. ■ 

The phrases ^'percent identity*' and identity," as applied to pQlypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods oi' polypeptide sequence ahgiment ^re wdl-knowiL Some alignrnent 
15 methods take intd account conservative amino acid substitutions. Such consCTvative substitutions, 
explained in more detail above, generally preserve the charge and.hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorjporated into the MEGALIGN version 3. 12e sequence alignmmt 
20 program (deiscribed and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap i>aialtyt=3, window5=5, and 

"(Sagonals isaved"=5. The PAM250 matrix is sielec^ed as the default residue weight table. As with 

* . * ■ " • ■ • ■ ■ 

polynucleotide alignments, the perctni identity is reported by CLUSTAL V as the **pCTcent similarity** 
between aligned polypq>tide sequrace pairs. 
25 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparisonof two polypeptide sequmces, one may use the "BLAST 2 Sequmces" tool Version 2.0,12 
(>^ril-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
30 Gap X drop-off: 50 

Expect: 10 
WordSize:3 
Filter: on 

Percent identity may be measured ova: the length of an entire defined polypeptide sequence, for 



BNSDOCID: <WO_0204520A2J_> 



wo 02/04520 • PCT/USOl/21448 

example, as defined by a particular SEQ ID number, or may be measured oyer a shorter length, for 

-' . ■ . ■ • . ' 

example, over the l^gth of a fl-agment taken from a larger, defined polypeptide sequence, for instance; 

a fragment of at least 15, at least 20, at least 30, at least 4o, at least 50if at least 70 car at least 150 

contiguous residues. Such lengths are exemplary only, and it is understood fliat any fragment length 

5 supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a l^gth ovar which percentage identity may be measured. 

"Himian artificial chromosomes" (HACs) are linear microcjiromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

10 ' ^ The to-m "humanized antibody" refers to an antibody molecule in wWcht^^ • 
sequence in the nori-antigen binding regions has been altered so that the antibody more closely 
, resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
con^jletnentary strand through base pairing under defined hybridization conditions. Spedfic 

15 hybridization is an indication that two nucldc acid sequ^ces share a high d^ee of complementarity. 
Specific hybridization complexes form under permissive ann^ng conditions and remain hybridized 
after the "wiashing*' step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding,. i,e. , 
binding betwe^ pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 

20 annealing of nucldc add sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions nuy be varied among 
e33)a'iments to achieve the desired stringency, and therefore hybridization spedficity. Permissive 
annealing conditions occur, for example^ at 68**C iii the pres^ice of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 |Lig/inl sheared, denatured salmon spemfiDNA. 

25 Generally, stringracy of hybridization is expressed, in part, with reference to the tetnpearature 

undo: which the wash stq> is carried out Such wash temperatures are typicadly selected to be about 
5*C to 20*C lower than the thermal melting point (TJ for the spedfic sequence at a defined ionic 
strength and pH. The T^^ is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T^ and conditions 

30 for nucleic acid hybridization are well known and can be found in Sambrook, J, et al. (1989) Molecular 
Cloning: A Laboratory Manual. 2''* ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of th^ present invention 
include wash conditions of 68°C in the: presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
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Alternatively, temperatures of about 65°C, 60^C, 55**C. or 42**C may be used. SSC coAcentration may 
be varied from about 0.1 to 2 x SSC. with SDS bdng present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents ihclude» for instancy . 
sheared and denatured salmon sperm DNA at about 100-200 Ug/nil. Organic solvdot, suc^ as 
5 formamide at a concientration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly imder high stringency 
conditions i may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar rolb for the nucleotides and their encoded polypeptides. 
10 ' , The term "hybridization complex*' refers to a complex formed between two nucleic acid ' 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
, complex may be formed in solution (e.g., Cot or Rot analysis) or formed between one nucleic acid 
sequence present in solution arid another nucleic acid sequence immobilized on a solid support (e.g.> 
pspex, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cdls 
15 or their nucleic acids have been fixed). 

The words "insertion*' and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflanimation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
20 various factors, e.g„ cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or 61igppq)tide fragment of TRICH which is 
capable of eliciting an imniune response when introduced into a living organism, for exanq^Ie, a 
mammal. The tarm "immunogenic fragment" also includes aSiy polypeptide or oiigqpq)tide fragment of 
25 TRICH whidi is useful in any of the antibody production methods disclosed herein or known in the art. 

'th& term **microarray" refers to an airangement of a plurality of polynucleotides, pblypqjtides, 
or other chemical compounds on a substrate. 

The terms "demeat" and "array element*' refer to a polynucleotide, polypq)tide, or other 
ch^cal compoimd having a unique and defined position on a microarray. 
30 ' The term "modulate*' refers to a change in the activity of TRICH. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of TRICH. 

The phrases "nucldc add" and "nucleic add sequence" refer to a nucleotide, oligonucleotide, 
polynudeotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
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synthetic origin which may he single-straiKled or double-stranded and may represent the sense'or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like. material.. 

"Operatily linked'' ref^ to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
5 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence; Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein toding regions^ in the samd reading £rame; 

'Teptide nucleic add" (PNA) refers to an antisense molecule or 

■ » . . • • ■ - 

comprises an,oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 

10 amino acid residues ending in lysine. The ta*minal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell, ' . 

'Tost-translational modification" of an TRICH may involve lipidation, glycosylation, 
phosphorylation, acetylation, raqeniiz'ation, proteolytic cleavage, and other modifications known in -the 

15 art These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cdl type dq)ending on the enzymatic milieu of TRICH, 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, wliich are used to detect identical, allelic or related nucleic acid sequences* Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reports molecule. Typical 

20 labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 

shOTt nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
cdnplemmtary base-pairing. The primar may then be extended along the target DNA strand by a DNA 
polymerase ^yme^ Primer pairs can be.used for amplification (and identification) of a nuclide acid 
sequ^ce> e.g., by the polymerase chain reaction (PGR). 

25 Probes and primers as used in the preset invmtion typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to, enhance specificity, longer probes and primers may also 
be enoployed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the discloised nuddc add sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 

30 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for prq^aring and using probes and primers are described in ibe references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratorv Manual. 2™* ed,, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubd, F.M, et al. (1987) Currait Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Inta-sdences, New York NY; Innis, M. et al. (1990) PGR 
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Protocols. A Guide to Methcxls and Applications. Academic Press. San Diego CA, PCR pifiiriia- pairs 
can be derived from a known sequmce, for example^ by using computer programs intoided for that 
purpose such as Primi^ (Version 6.5, 1991 , Whitehead Institute for Biomedical Research, Cambridge 
MA). * ; 

5 Oligonucleotides for use as primers are selected using soitware known in the art for such 

purpose. For exanqfle, OLIGO 4.06 software is useful for the selection of PGR primer, pairs of up to 
100 nucleotides each, and for'the analysis of 'oligonucleotides and largo* polynucleotides of up to ?,000 
nucleotides from an,input pplynucieotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additionsfl features for expanded capabilities. For example, the PrimOU 

10 primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primes from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The PrimotS prima* selection 
— program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 

Cambridge MA) allows the user to input a "mispriming library," in wjiich sequences to avoid as primer 

15 binding sites are bser-specified. PrimerS is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence ahgnments, thereby allowing selection of primo-s ttiat 

20 hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useftil for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragjnents identified by any of the 
above sdection methods are usefiil in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elemrats, or specific probes to identify fhlly or partially complementary 
. 25 polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. . ' * 

A '"recombinant nucleic acid" is a sequence that is not naturally occurring has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequaace. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

30 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic mgineedng techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic adds that have 
been alt^ed solely by addition, substitution, or deletion of a portion of the nucldc add. Frequrafly, a 
recombinant nucldc acid may include a nucldc add sequence pporably linked to a piromoter sequence. 
Sudi a recombinant nucldc acid may be part of a vector that is used, for example^ to transform a cell. 
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I . . . • 

Alternatively, such recombinant nucldic acids may be part of a viral vector, e.g., based on a 

" vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

expressed, indudng a protective inimunoldgicalrfesponsd in t& ' 

A "'regulatory el^ent'' refers to a nucldic acid sequence usually derived from untranslated 

5 regions of a g^ and includes enhancers, promoters, introns, and 5' and 3* untranslated regions (UTRs). 

Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 

stability. ' . ^ , 

"'Rq)orter molecules" are i^emicsQ qrbiochendcsd mcdedes u^ 

amino add, or antibody. Reporter molecules include radionuclides; ^izymes;flucH*e5cent; 

10 I chemiluminescent, or chromogenic agents; substrates; cofactbrs; inhibitors; magnetic particles; and i 

• ...» 

other moieties known in the art 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
15 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH. or fragments th^eof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 
20 The terais "specific binding" and "specifically binding" refa: to that intCTaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protdn, e.g., the antigenic detmninant or epitope, recognized by -the binding molecule. For 
example, if an antibody is specific for q)itope **A/* the presence of a polypeptide comprising the q>itope 
25 ' A, or the presence of free imlabeled A, in a reaction containing free labeled A ^ 
reduce the amount of labeled A that binds to the antibody. 

The t^m "substantially purified" refers to nucldc add or amino add sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
•prefarably at least 75% free, and most preferably at least 90% free from other components with whidi 
30 they ^e naturally assodated. 

A "substitution" refers to the replacement of one or more amino add residues or nucleotides by 
differmt amino acid residues or nudeotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support induding membranes, filters, 
dtdps, slides, wafers, fibers, magnetic or nomnagnetic beads, gels, tubing, plates, polymers, 
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microparticles and capillaries. The substrate can have a variety of suif ace forms, such as >yells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. ' 

^ A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue undo: given conditions at a given tiirie. ' ' 

5 'Transformation" describes a process by which exogenous DNA is introduced into a recipient 

cell. Transformation may occur under natural or artificial conditions according to various methods wdl 
known in the art, and may rely on any known method fpr the ins^on of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, bactmophage or viral infection, 

10 " ' e)ectroporation, heat shock, lipofectipn, and particle bombardment The term "transformed cells" * . 
includes stably transformed cells in which the inserted DNA is capable of rq)lication dth^ as an 
, autonomously rqdicating plasmid or as part of the host chromosome^ as weD as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time, 

A "transgenic organism," as used herein, is toy organism, including but not limited to 

15 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human int^ention, such as by transgenic tebhniques wdl known in the 
art The riudeic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection witti 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 

20 vitro fertilization, but ratha: is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cy anobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 

25 into sudd organisms are widely known and provided in references such as Sambrook et al. (1989), 
suT>ra > " 

A "variant*' of a particular hucldc acid sequence is defined as a nucldc acid sequence having, at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
• nucldc acid sequaices using blastn with the **BLAST 2 Sequences" tool Version 2.0,9 (May-07-1999) 
30 set at default parameters. Such a pair of nucldc acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%. at 
least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
idmtity ova* a certain defined l^gth. A variant may be desaibed as, for example, an "allelic" (as 
defined above), "qflice," "species," or "polymorphic" variant A splice variant may have significant 
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identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 

5 polypeptides will generally have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a giveaa 
species. PolymorpMc variants alsp may encompass "single nucleotide polym^ 
which the polynucleptide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, 'for: exanaple, a certain jpopulation, a disease state, or a propensity for a disease state. 

10 A "^variant" of a particular polypeptide sequence is defined as a polypeptide sequence haying at 

least 40% sequence identity to the particular pol3pqptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool* Version 2.0.9 (May-07-1999) 
set at default parametears. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%; at least 90%, atl€^91%, atleast9;2%, atleast93%, atleast94%, at 

15 least 95%, at leait 96%, at least 97%, at least 98%, or at least 99% or greater sequence identity ovct a 
certain defined lengfli of one of the polypeptides. 

THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels (TRICH), 
20 the polynucleotides raicoding TRICH, and the use of these compositions for Hie diagnosis, treatment, or 
prevention of transport, neurological, muscle, immunological, and cell prolifra*ative discorders. 

Table 1 summarizes the nomenclature for the fiin length polynucleotide and polypqptide 
sequences of the invention. Each polynucleotide and its corresponding polypq>tide are correlated to a 
single Incyte project identification number (Jncyte Project ID); Each polypeptide sequ^e is denoted 
25 by both a polyp!q>tide sequmce identification number (Polypeptide SEQ ID NO;) and an Incyte 
polypeptide sequence number (hicyte Polypeptide ID) as shown. Each polynucleotide sequace is 
denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequaiice numb^ (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as id^tified by 
30 BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Colunrn 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank . 



BNSDOCID: <WO_0204520A2J_> 



wo 02/04520 PCT/USOl/21448 

homolog- Colunin 5 shows the annotation of the G^ank homolog along with r^evant citations where 

I- * " ' ' . ' ' 

applicable* all of which are expressly incorporated by refarence herein. 

Table 3 shows various structural feature of the polypeptides of the inventioa . Columns 1 and 

2 show the polypeptide sequence identification number (SEQ ID'NO:) and the corresponding Incyte 

5 polypeptide sequence numba: Oocyte Polypeptide ID) for each polypeptide of the inventioa Column 3 

shows the numbo- of amino acid residues in each polypeptide. Column 4 shows potential 

phosphorylation sites, arid column 5 shows pbtential glycosylation sites, as determined by the'MOTIFS 

program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 

Column 6 shofws amino add residues^ comprising signature sequences, domains, and motifs. Column 7 

10 shows analytical methods for protein structure/function analysis and in some cases, searchable 

databases to which the analytical methods ware applied. ' 

Together j Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 

properties establish that the claimed polypeptides are transporters and ion channels. For example, SEQ 

ID NO:5 is 83% identical to rat GABA recq)tor rhQ-3 subunit precursor (GenBank Jp gI060975) as 

15 determined by the Basic Local AUgnment Search Tool (BLASTO. (Seq Table 2.) The BLAST 

probability score is 1 .7e-206, which indicates the probability of obtaining the observed polypq}tide 

sequence alignment by chance. SEQ ID NO'5 also contains a neurotransmitter-gated ion channel 

domain as determined by searching for statistically significant matches in the hidden Markov model 

(HMM)-based PFAM database of conserved protein family doniainsv (See Table 3.) Data from 

20 BLIMPS, MOTIFS, and PROFDLESCAN analyses provide further corroborative evidence that SEQ ID 

NO:5 is a neurotransmittor-gated ion channel. In an alternate example, SEQ ID NO:16 is 57% 

identical to human Na+/ghicose cotransporta: (GenBank ID g338055) as determined by the Basic Local 

Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 2.4e-181, which 

indicates the probability of obtaining the observed polypqptide sequmce alignmeait by chance. SEQ ID 

25 NO: 1 6 also contains a sodiumrsolute symportar family domain as determined by searching for 

statisticsdly significant matches in the hiddm'Markov model (HMM)-based PFAM database of 

conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN 

analyses provide further corroborative evidence that SEQ ID NO:16 is a Na+/glucose cotransportar. In 

an alternate example, SEQ ID NO:27 is 53% identical to human ATP-binding cassette transporter- 1 

30 (ABC-1) (GenBank ID g41 28033) as determined by the Basic Local Alignment Search Tool (BLAST). 

(See Table 2.) The BLAST probabihty score is 0.0, which indicates the probability of obtaining the 

observed polypeptide sequence ahgnment by chance. SEQ ID NO:27 also contains an ABC transporter 

domain as determined by searching for statistically significant matches in the hidden Markov model 

(HMM)-based PFAM database of conserved protdn family domains. (See Table 3.) Data from 
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BLIMPS, MOTIFS, and PROFILESCAN analyses provide furflier coirobbrative evidence that SEQ ID 
NO:27 is an ABC transporter. In an alternate example, SEQ ID Nb:12 is 45% idratical to rat thyroid 
sodium/iodide symporter NIS (GenBank lb gl 3919954) as' ddt^ndd l)y the Basic Local AUgnn^ 
Search Tool (BLAST). (See Table 2.) The BLAST probabiUty score is 3.0e^l43, which indicates the 
5 probabiUty of obt£dning the observed polypeptide sequence alignnim^ SEQ ID NO:12 also 

contains a sodimnrsolute syn5)orter family doflaain as dete^ 

significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 30 Data from BLIMPS and PROFILESCAN analy^^ 
corroborative evidence that SEQ ID NO:12 is a sodimn:solute symporter. SEQ ID NO: 1-4, SEQ ID 
10 • NO:6-l 1, SEQ ID NO:13-15, SEQ ID NO:17«26, and SEQ ID NO:28^32 were analyzed tod annotdted 
in a similar manner*. The algorithms and parameto-s for the analysis of SEQ ID NO:l-32 are described 
in Table 7. 

— As shown in Table 4, the fiill length polynucleotide sequences of the present 

assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any " 

15 combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corrdsponding Incyte polynucleotide 
consensus sequence number Qncyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polymicleotide sequaice in basepairs, Colunm 4 liste 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 

20 technologies that identify SEQ ID NO:33-64 or that distinguish between SEQ ID NO:33-64 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
se^enc^, coding sequences (exons) predicted from genomic DNA, and/oar sequence assemblages 
comprised of both cEiNA arid genonric DNA These sequmces were used to assemble the lull lengfli 
polynucleotide sequaices of the invention. Columns 6 and 7.6f Table 4 show the nucleotide start (5') 

25 and stop (30 positions ofthecDNA and/or genomic sequences in colunm 5 rdati^ 
fun length sequences. 

The identification numbers in Colunrn 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 6724643H1 is the 
identification numba- of an Incyte cDNA sequence, and LUNLTMTOl is the cDNA library from 
30 which it is derived. Incyte cDNAs for which cDNA libraries are not indicated wa-e derived from 

pooled cDNA libraries (e.g., 71495515V1). /Jternatively, the identification numbers in column 5 may 
refer to GenBank cDNAs or ESTs (e.g., g5746200) which contributed to the ass^ly of the fioll length 
polynucleotide sequences. In addition, the identification numbers in column 5 may identify sequences 
derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database {le., those sequences 

33 

BNSDOCID:<WO 0204520A2J_> 



wo 02/04520 PCTAJSOl/21448 

including the designation ''ENST"). Alternatively, the identification nmnbo-s m colnnm 5 may 
derived from the NCBI RelSeq Nucleotide Seqaeaice Records Database fre.. those sequences including 
the designation "NM" or "NT*) or the NCBI RefSeq Protein SequMice Records those sequences 
including the designation "NP*') Alternatively, the id^tification numbers in column 5 may refer to 

5 assemblages of both cDN A and Genscan-predicted exons brought togeth^ by an "exon stitching" 
algorithm. For example, I^^XXXXXX^Ar;^N2_YyiTy.iV3^N^ represents a "stitched" sequence in 
which XXXXXXis the identifi'catioji number of the cluster of sequences to which the algorithril was 
applied, and YYVYY, is the number of the prediction generated by the algorithm, and iVj^,^..., if present, 
represent specific exons that may have been manually edited during analysis (See Example V). 

10 Alternatively, the identification numbers in column 5 may refer to assemblages of exons brought 

togetha* by an "exbn-stretching" algorithm. For example, ¥LXXXXXX_sAAAAA^BBBB_l_N is the 
identification number of a "stretched" sequence, with XXXXXX being thq Incyte project identification 
numba-. gAAAAA being the GraBank identification number of the human genomic sequence to which 
the "exon-stretching" algorithm Was applied, gBBBBB being the GenBank identification numb^ or 

15 NCBI RefSeq identification number of the nearest GenBarik protein homolog, and N refming to 

specific exons (See Example V). In instances where a RefiSeq sequence was used as a protein homolog 
for the "exon-stretching" algorithm, a RefSeq identifier (denoted by •'NM," ^W." or "NT") may be 
used in place of the GenBankidentifior (i.^;, gBB5BB). 

AltOTiativdy, a prefix identifies component sequences that were hand-edited, predicted from 

20 genomic DNA sequences, or derived from a combination of sequence analysis methods. The following 
Table lists examples of component sequence prefixes and corresponding sequence analysis methods 
associated with the prefixes (see Exan5)le IV and :BxampleV).. ^ 



Prefix 


Type of analysis and/or exarriples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford Univarsity, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequMces. 


FL 


Stitched or stiretched genomic sequences (see Example V). 



In some cases. Incyte cDNA coverage redundant with the sequence coverage shown in column 
30 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant incyte cDNA 
identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for tiiose full length polynucleotide sequences 
which ware assembled using Incyte cDNA sequences. Hie representative cDNA library is the Incyte 
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cDNA library which is most frequently rq)resented by the Incyte cDNA sequences which were used to 

•It' • ' ^ • . ■ 

assemble and confirm the above polynucleotide sequences. The tissues and vectors which wa*e used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICH variant is one which has . 
5 at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the TRICH amino acid sequence, and which cdntaii^s at least one functional or structural 
characteristic of TRICH. • ' i » 

The invention also encompasses polynucleotides which encode TRICH, In a particular 
embodiment, the invention encom]()asses a polynucleotide sequence comprising a sequesice^ selected from 

10 * tliegroupconsistingof SEQ ID NO;33-64, which CTCodes TRICH. The polynucleotide sequmces df 
SEQ ID NO:33-64', as presented in the Sequence Listing, embrade the equivalent RNA sequraces, 
, wherein occun-ences of the nitrogenous base thymine are rqplaced with uracil, and the sugar bacld)6ne 
is composed of ribose instead of deoxyribose. 

Tbe invention also aicompasses a variaxit of k polynucleotide sequence encoding TRICH. In 

15 particular, such a variant polynucleotide sequence wfll have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding TRICH. A particular aspect of the invention ^compasses a variant of a polynucleotide 
sequence comprising a sequraice selected flrom the group consisting of SEQ ID NO:33-64 which has at 
least about 70%, or alternatively at least about 85%, or ev^ at least aboat 95 % polynudeodde 

20 sequence identity to a nucleic add sequmce sheeted from the group consisting of SEQ ID NO:33-64. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
cpntains at least one functional or structural characteristic of TRICH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of tbe gendic 
code, a multitude of polynucleotide sequences encoding TRIGH, some bearing minimal similarity to the 

25 polynucleotide sequences of any Imown and naturally occuning gme, may be produ Thus, ttie 

invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of natur ially 
occurring TRICH, and all such variations are to be consid^ed as being specifically disclosed 

30 ' Although nucleotide sequences which encode TRICH and its variants are generally capable of 

hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH or 
its derivatives possessing a substantisQly different codoh usage, e.g. , inclusion of non-naturaHy 
occurring codons. Codons may be selected to increase the rate at which expression of tbe peptide 
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occurs in a particular prokarydtic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Otho" reasons for substantiaDy altering the nucleotide 
sequence encoding TRICH arid its daivatives without altering 

include the production of RN A transcripts having more desirable prq[)erties, such a^ a greater half-life» 
5 than transcripts produced from the natiirally occurring seqiieaic^^ 

The invention also encompasses production of DNA sequences which encode TRIC|i and 
TRICH dorivatives, or fragments thereof, entirdy by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the afrt Moreover, synthetic chemistry may be used to introduce 

10 ' n;iutations into a sequence encoding TRICH or any fragment thereof. * 
Also encompassed by the invention are polynucleotide sequences that are capable of 
, hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in iSEQ ID 
NO:33-64 and fragments thereof under various conditions of strihgmcy. (See» e.g., WaW, G.M. and 
SX. Berg^ (1987) MeOiods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

15 152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
''Definitions." * . 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragmmt of 
DNA polym^ase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 

20 Biosystems),therniostableT7 polymerase (Ama-sham Pharmacia Biotech, Pisca^^^ 

combinations of polymerases and proofi^eading exonucleases such as those found in the ELONGASE 
aipplification system (Life Technologies, GaithCTsburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfdr systrai (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 

25 (Applied Biosystems). Sequencing is then carried but using either the ABI 373 or 377 DNA sequeaicing 
system (^plied Bibsystons), theMEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or othar systems known in the art. The resulting sequences are analyzed using a 
vari^ of algorittm^ which are weJl known in the art. (See, e,g., Ausubel, P.M. (1997) Short Protocols 
in Molecular Biology. John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular 

30 Biolcfgy and Biotechnologv. Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequOTce and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
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DNA within a cloning veaor. (See, e.g„ Sarkar, G. (1993) PGR MettKjds Ajrplic. 2:318-322.) 
Another method, inverse ¥CR, uses primars that extentfin divorgrat directions to. amplify unlaio>vn 
sequence from a circularized template. The tieniplate is derived froim restriction fir Agmenfs comprising a 
known genonuc locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
5 Res. 1 6:8 1 86.) A third method, capture PGR, involves PGR an5)lification of PNA firagments adjac^ 
to known sequences in human and yeast artificial chromosome PNA. (See, e.g., Lagerstrom, M, et al. 
(1991) PGR Methods Applic. 'l : 1 11-1 1 9,) In this method^ multiple restrictipn enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PGR. Other methods which may be used to retrieve unknown sequences 
10 are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PGR, nested primers, and PROMOTERFINDER libraries (Glontech, Palo 
Alto GA) to walk genomic DNA. This procedure avoids the need to sarep libraries and is useful in 
finding intron/exon junctions. For all PGR-based ihethods, primars may be designed using 
commorcially available softwarcy su'ch as OLIGO 4.06 primer analysis software (National Biosciences, 
15 Plymouth MN) of anoth^ appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GG content of about 50% or more, and to anheal to the template at temperatures of about 68**G to 
72^G. • 

When screwing for lull length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In additioii, random-primed libraries, which often include 
20 sequences containing the 5' regions of genes, are pref«*able for situations in which an oUgo d(T) libraiy 
does not yidd a full-l«igth cDNA. Genomic libraries may be useful fot extension of sequence into 5' 
non-transcribed regulatory regions. * . 

Gapillary dectrophoresis systems, which are comma*daIly available may be used to analyze the 
size or confinn the nucleotide sequence of sequmdng cm: PCK products. In particular, capillary 
25 sequencing may employ flowable polymers for electrqphoretic sq>aration, four different nucleotide- 
specific, lasar-stimulated fluorescent dyes, and a charge coupled dievice camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to dectrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENGE NAVIGATOR, AppUed Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
30 controlled Gapillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof whidi 
©acode TRICH may be cloned in recombinant DNA molecules that direct expression of TRIGH, or 
fragments or functional equivalents thereof, in appropriate host cdls. Due to the inherent degeneracy of 
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the genetic c6de» other DNA se^races which ^code substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express TRICH. . 

The nucleotide sequences of the present invention can be engineered usiig methods gen^-ally 
knovm in the art in OTder to alter TRICH-encoding sequences for a variety of purposes including, but 
5 not limited to, modification of the cloning, processing, and/or expression of the gene produa. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide-, 
mediated site-directed mutagenesis may be used to introduce mutations that create^new restriction sites, 
alter glycosyMtion patterns, change codon pref^ence, produce splice variants, and so forth. 

10 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOUECiULARBREEDING (Maxygen Inc.,- Santa Clara CA; described in U.S. Patent Number 
- 5,837,458; Chang, G.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, .A. et al. (1^96) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties ok TRICH, such as its biological or, enzymatic activity or its ability 

15 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced usiiig PCR-mediated rebombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired • 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 

20 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired prop^es are 
optiuMzed. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occu&ing genes in a directed and controllable 

25 , mann^. 

, In anothea: embodiment, sequences ei^coding TRICH may be synthesized, in whole or in part, 

using ch^cal methods well known In the art. (See, e.g,, Caruthers, M.H. et al. (1980) Nucleic Adds 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Set. 7:225-232.) Altemativdy, 
llilCHitsdf or a fragment thereof may be synthesized using c^em^ For example, peptide 

30 synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 

Crdghton,.T. (1984) Protdns. Structures and Molecular Properties. WH Freeman, New YarkNYi pp. 
55-60; and Roberge, J.Y. k al. (1995) Sdience 269:202-204.) Automated synthesis may be achieved 
using the ABI 43 1 A pq)tide synthesizer (Applied Biosyst^ns). Additionally, the amino add sequence 
of TRICH, or any part th^eof , may be altered during direct sjoithesis and/or combined with sequences 
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jfrom other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring iKdypeptide-. 

The peptide may be substantially J)uf ified by preparative high j^erformance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regni^ (1990).Methods Enzymol. 18^:392-421.) 

5 The composition of the synthetic peptides may be conjarmed by amino acid analysis or by sequencing. 
(See, e-g., Creightoii, supra, pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector whidi contains 
the necessary elements for transcriptional and translational control of the inserted coding sequ^ce in a 

10 I suitable host These elements include regulatory sequences, such as enhancers, constitutive an i 
inducible promoters, and 5 ' and 3' untranslated regions in the vector and in polynucleotide sequences 
encoding TRICH. Sudi elmients may vary in thdr strength and specifidty. Spedfici^ 
may also be used to achieve inore efficient translation of sequences encocfing TRICH. Such signals 
indude the ATG initiation codon and adjacent secpiences, e.g, the Kozak sequence. In cases where' 

15 sequences encoding TRICH and its initiation codon and upstream regulatory sequmces are inserted into 
the appropriate expression vector, no additional tf ansoriptional or translational control signals may be 
needed. However, in cases where only coding sequence^ or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the 
vectc^. Exogenous translational el^ents and initiation codons may be of various origins, both natural 

20 and synthetic. The efSdency of e:7qpression may be enhanced by the inclusion of enhancers appropriate 
for the partiadar host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cdl Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct e3q)ression 
vectors containing sequences aicoding TRICH and apprc^riate transcriptional and translational control 

25 * elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and m 
vivo genetic recombination, (See. e.g.. Sambrook. J. et al. (1989> Molecular Cloning, ALaboratCHrv 
Manual. Cold Spring Harbor Press, Plain^ew NY, ch. 4, 8, and 16-17; Ausubea,*F.M. et al. (1995) 
Current Protocols in Molecular Biology . John Wney & Sons, New York NY, di. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequ^ces 

30 encocjing TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bact^iophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovjrus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e,g,, Ti or pBR322 plasmids); or 
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animal cdl systems. (See, e.g., Sambrook, supra : Ausubel, sutira : Van Heeke, G. and S.M, Sdiuster 
(1989) J. Biol. Chem. 264:5503-5509; EngeihanJ, E.K. et al. (1994) Proc. Natl. Acad. Sd. USA 
91:3224-3227; Sandig, V. et ja. (1996) Hum. GoielHar.' 7:1937-1^45; Takamatsu, N, (1987) EMBO 
1 6:307-31 1 : the McGraw Hill Yearbook of Science aiid Tetchncflogy (1992) McGraw Hill, New. 
5 York NY. pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Nafl. Acad. Sci. USA 81 :3655-3659; and 
Harrington, J J. et al. (1997) Nat Genet 15:3457355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from v^ious bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu. M. et al. (1993) Proc. Natl. Acad. Sd. 
10 ' USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et^al. 
" . (1994) Mol. Immunol 31(3):219-226; and Veima, I.M. and N. Somia (1997) Nature 389:239-242,) 
• The invention is not limited by the host cell employed. 

III bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICHi For example, routine cloning, 
15 subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 

multifunctional E. coli vector such as PBLUESCRIPT (Stratageiie, La Jolla CA) or PSPORTl plasmid 
(Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria contaimng recombinant molecules. Inaddition, these veaors may be usefid for invito 
20 transcription, dideoxy sequencing, single strand rescue with helps- phage, and creation of nested 

deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of antibodies, 
vectors which direct high levd expression of TRICH may be used. For example, vectors containing the 
strong, inducible SP6 or T7 bacteriophage promoter may be used. 
25 Yeast expression systems may be used for production of TRICH. Anumber of vectors 

containing constitutive or inducible promoters, such as apha facta:, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris.' In addition, such 
vectors direct either the secretion or intracellular retration of e5qt>ressed protdns and enable integration 
' of fordgn sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, sues; 
30 Bitter, G.A et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA. et al. (1994) 
' Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences encoding 
TRICH may be driven by viral promotes, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311). 
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Altonatively> plant promoters such as the small subunit of RUBISCO or heat shock promoter^ may he 
used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; BrogUe, R. et al. (1984) Science 
224:838-843; and Winter,' J. et al ' (1991) Results Probl. Cell Differ. 17:85-105!) ''These constructs can 
be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 

5 e.g.. The McGraw Hill Yearbook of Science and Technology (1992^ McGraw HUl, New York NY, pp. * 
191-196.) . 

In mammalian cells, a nuipbei of viral-based expression systems may be utiUzed. In cases 
where an adenovirus is used as an 'e3q)ression vector, sequences encoding TRICH may be ligated into an 
adenovirus transcription/translation complex consisting of the late pronioter and tripartite leada: 

10 sequence; Insertion in* a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus whiQi expresses TRIGH in host cells. (See, e.g., Logan, I. and T, Shenk (1984),Proc. 
Natl. Acad Sd. USA 81:3655-3659.) In addition, transaiptiah enhances, stidi as the Ro^^ 
virus (RS\Oei)hancer, may be used to increase expression!^ SV40o9rEBV- 
based vectors may also be used for Wgh^level protein expression. , 

15 Human artificial .chromosomes (HACs) may also be anployed to deJiVo; larger fragmmts of 

DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Gaiet. 15:345-355.) 
For long term production of recombiriaut'proteans in mtoimalian systrans, stable expression of 

20 TRICH in cell lines is preferred. For exaniple, sequences encoding TRICH can be transformed into cell 
lines using expression vectors wUch may contain viral origins of replication and/or endogenous 

e3q>ression elementsandaseleclBblemarker g^ednthesameor onasep Following the 

• ■ . ■ 

introduction of the vector, cells may be allowed to grow for about 1 1& 2 days in enriched media before 
bdng switched to selective media, llie purpose of Ihe sdectai^le marker is to confer resistance to a 

25 sdective agent, and its presence allows growth and recovery of cells whidi successfully express the 
introduced sequCTices. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cdl type. 

Any numb^ of selection systems may be used to recover transformed cell lines. These include^ 
but are not limited to, the herpes simplex virus thymidine Idnase and adenine phosphoribosyltransfCTase 

30 genes, for use in tk and apf cells, respectively. (See, e.g., Wigler, M, et al. (1977) Cell 1 1 :223-232; 
Lowy, L et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or harbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; 7ieo confers 
resistance to the aminoglycosides neomycin and G-41 8; and dls and jparconfer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See. e.g., Wigler, M. et al. (1980) 
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Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, K et al. (1981) J. Mol. Biol. 150:1-14.) 

Additional selectable g«ies havebem described, e.g., trpB and hisD, which alter cdlular requirenaents 

for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Prop. Nati; Acad Sd. USA 

85:8047-8051 .) Visible markers, e.g., anthocyanins, green fluorescent protdns (GFP; Clbntech), B 

5 glucuronidase and its substrate B-glucuronide, or Indferase and its substrate ludfiain may be used. 

These markers can be used not only to identify transformants, but also to quantify the amount of 

■ " *' • . * 

transient or stable protein expression attributable to a specific vector system. (See, eig., Rhodes, C.A. 

(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/abience of marker gene expression suggests that the gene of interest is 

10 ' ajso.present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TRICH is inserted within a mark^ gene sequence, transformed cells containing 
, sequdices mcoding TRICH can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandOTi with a sequmce encoding TRICH under the control of a single 
promoter. Expression of the mark^ gene in response' to induction or -selection usually indicates 

15 expression ofthe tandem gene as wdl. 

In general, host cells that contain the nucldc add isequence encoding TRICH and that express 
TRICHmaybeidentifiedby a variety of procedures kno>vn to those of skill in the a^ These 
procedures include, but are not limited to, DN A-DN A or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay tediniques whidi include membrane, solution, or 

20 chip based technologies for the detection and/or quantification of nudeic acid or protdn sequences. 
Immunologics^ methods for detecting and measuring the expression of TRICH u^^^ 
specific jKflyclonal or monoclonal antibodies are known in the art Examples of such techniques include 
razyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cdi sorting (FACS). A two-site, monodonal-based immunoassay utilizing monoclonal 

25 antibodies reactive to two non-intearfering epitopes on TRICH is preforred, but a competitive binding 
assay may be employed. These and other asSays are weJl known in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul MN, Sect IV; Coligan, J.R 
et al. (1997) Current Protocols in Immunologv . Greene Pub. Assodates and Wiley-Intersdence, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

30 ' A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino add assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences rdated to polynucleotides encoding TRICH include 
oligolabeling, nick translation, end-labeling» or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any firagments thereof, may be cloned into a vector for 
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the productidn of an iriRNA.probe. Such vectors are known in flie art, are commercially available, and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as 
T7,T3, or SP6 and labeled nucleotides. These procedures may be conducted usin^ a variety of 
commercially available kits, such as those provided by Am^sham Pharmacia Biotech, Prpm^a 
5 (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzyriies, fluo-escent, chemilyminesc^ or chromogenic agents, as 
wdl as substrates, cofactors, ihhibitOT^^ ^ 

Host ceBs transformed with nucleotide sequences encoding ^TO 
conditions suitable for the. expression and recovery of the protein from cell culture. The protein 

10 produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the X^ector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode TRICH may be designed to contain signal pequences which direct 
secretion of TRIGH ttoough a prokaryotic or eukai'yotic cell menibrane. 

In addition, a host cell straiA may be chosen for its ability to modulate expression of the 

15 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, adetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylatioa Post-translational processing which cleaves a **prq)ro" or *'pro" form of the 
protein may also be used to specify protdntargetuig, folding, and/or activity^ Differoit host cdls 
which have specific cdlular machine and characteristic mechanisms for post-translational iactivities 

20 (e.g.,CHO,HeLa,MIX:K,HEK293, and WI38) are available fr^ 

Collection (ATCC, Manassas VA) and may be chosen to ensure the cacrect modification and processing 
of the fordgn protdn. 

lii another embodiment of the invention, natural, modified, or recombinant nucleic add 
sequences encoding TRICH may be ligated to a heterologous3isequence resulting in translation of a 

25 fusion protdn in any of the aforemMtioned host systems. Fot example, a chuneric TRICH protein 
containing a heterologous moiety that can be recognized by a conunercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRJCH activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commiercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

30 binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectivdy. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using coDMnercially available monoclonal and polyclonal antibodies that spedfically recognize 
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these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage ^ite 
located between the TRICH encoding sequence and the heterologous protdn sequence, so that TRICH 
may be cleaved away jfrom the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubd (1 995 , supra , ch. 10). A variety of commercially 
5 available kits may also be used to facilitate expression and purification of fusion protdns. 

In a furtho: embodiment of the invention, synthesis of radiolabeled TRICH may be achieved in 
vitro using the TNT rabbit reticulqcyte lysate or wheat germ extract system (Promega). The^e sy^ems 
couple transcription, and translation of protem-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 

10 example, ^^S-methionihe. 

TfIiCH of the present invention or fragments thereof may be used to screen for compounds 
that specificany bind to TRICH. At least one and up to a plurality of te?t compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. , 

15 In one embodiment, the compound thus identified is closely related to the natural ligand of 

TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunologv 1(2): 
Chapter 5 .) Similarly, the compound can be closely related to the natural receptor to. which TRICH 
binds, or to at least a fragment of the receptor, e.g„ the ligand binding site. In eitbes: case, the 

20 compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing J5)propriate cells which express TOICH, either as a secreted 
protedn or on the ceQ membrane. Preferred cells include cells from mammals, yeast, Drosophfla. or 
coli . Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 

25 compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 

30 AltOTiatively, the assay may detect or measure binding of a test compound in the presence of a 

labeled competitor. Additionally, the assay may be canied out using cell-^free preparations, chemical . 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support 

TRICH of the present invention or fragments thereof may be used to screen for compoimds 
35 that modulate the activity of TRICH. Such compounds may include agonic, antagonists, or partial 
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or inverse agonists. In one embodiment, an assay is performed under conditions permissive for TRICH 
activity, wherein TRICH is combined with at.least one test compound, and the activity of TRICH in the 
presence of a test compound is compared with the activity of TRICH in the absence of the test 
compoxmd. A change in the activity of TRICH in the presence of the test compound is indicative of a 
5 compound that modulates the activity of TRICH. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising TRICH under conditions siuitable for TRICH activity, and the 
assay is performed. In eith]^ of these assays, a test compound whicjh modulates the activity of TRICH 
may do so indirectly and need not come in direct contact with the test can^und. At least one and up to 
a plurality of test compounds may be scre^ied. 

10 I In another ^bodiment, polynucleotides ^coding TRICH c»: thdr mammalian homologs may 

be "knocked out" in an animal model system using homologous recombination in embryonic stein (ES) 
cdls. Such techniques are well known in the art and are useful for the generation of animia] mcidels of 
human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For 
example^ mouse ES cells, sudi as the noouse 129/SvX cell line» are derived from the early mouse exhbryo 

15 and grown in culture. The ES cells are transformed with a vector containing the gene of iniarest 

disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (nep; Capecdu, M.R. (1989) 
Science 244:1288-1292). The vector integrates into the corresponcfinjg region of the host genome by 
' homologous recombination. Alternatively, homologous recombination takes place using the Cre-lo^ 

systrai to knockout a geiie of interest in a tissue- or developmental stage-spedfic maimer (Marth, J.D. 

20 (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Adds Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts sudi as those firom . 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudqpregnant dams, and the 
resulting ddmeric progeny are gaiotyped and bred to produce heterozygous or homozygous strains. 
Transgemc animals thus generated may be tested with potential therapeutic or toxic agmts. 

25 Polynucleotides mcoding TRICH may also be manipiulated in vitro in ES cells derived ftom 

human blastocysts. Human ES cdls have the potential to diif earenliate into at least dght separate ceil 
lineages including radoderm, mesoderni, and ectodermal cell types. These cdl lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) 
Science 282:1 145-1 147). 

30 . Polynucleotides encoding TRICH can also be used to create "knockitf * humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
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potential pharmaceutical agents to obtain infonnation on treatmmt of a huinan disease. Altanativdy. a 
mammal inbred to ov^express TRICH, e.g., by secreBng TRICH in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol ■ Arniu. Rev. 4:55-74). . 
THERAPEUTICS . . 

5 Chemical and structm-al similarity, e.g., in the context of sequences and motifs, exists 

between regions of TRICH and transporters and ioh channels. In addition, the expression of TRICH 
is closely associated with adrenal, testicular, and prostate tumors, Crohn's disease, teratocarcinoma 
arid dendritic cells, brain, lung, ileum, small intestine, uterine myometrial, colon, and p^creatic 
tissues. Therefore, TRICH appears to play a role in transport, neurologicjQ, muscle, iinmunolo^ 
10 ' and cell proliferative disorders. In the treatment of disorders associated with increased TRICH * 
expression or activity, it is desirable to decrease the expression or activity of TRICH. In the treatment 
of disord^ associated with decreased TRICH expression or activity, it is desirable to increase the 
expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof m^ 
15 administered to a subject to treat or prevent a disord^ associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are'not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy. Bell's palsy, Chareot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 
diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic pmodic paralysis, normokalemic 
20 periodic paralysis, Parkinson's disease, malignant hyp^ermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, brady arrythmia, 
tachyanythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
myopathy, centronuclear myopathy, Kpid myopathy, mitocfiondrial myopathy, thyrotoxic myopathy, 
25 * ethanol myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis^ 

neurological disorders associated with transport, e.g., Alzhenner's disease, amnesia, bipolar disord^, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 
• neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
30 stenosis, sensorineural autosomal deafiiess. hyperglycemia, hypoglycemia. Grave's disease, goita:, 
Cushing's disease, Addison's disease, gjucose-galactose malabsorption syndrome, 
. hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystiiiuria, iminoglydnuria, Hartup disease, and Fanconi disease; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, c^ebral neoplasms, 
35 Alzheimer' s disease. Pick' s disease, Huntington' s disease, dementia, Parkinson's disease and oth^ 
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extrapyramidal disorders, apiyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bicterial land viral meningitis, braiii abscess, subdural eriipyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis an^ radiculitis, viral central nervous 
5 system disease, prion diseases including kuru, Creutzfeidt-Jakob disease, and Getstmann- 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromktosis, tuberou^ sclerosis, cei;ebelloretinal hemangioblastomatopis, 

encephalotrigeminal syndrome, mental retardation and other developmental disorders of the cenbral 

» . . ■ ■ 

n^ous systejra including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

10 nervous system disorder^, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasth^a gravis, periodic paralysis, mentJil 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 

15 postherpetic neuijalgia, Tourette's disorder, pjrogressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; a muscle disorder such as carc^omyopathy, myocarditis, 
I>uchenne's muscular dystrophy, Becky's musculo dystrophy, myotonic dystrophy, central core 
disease, nemaline myopathy, centronudear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis^ polymyositis; dermatomyositis, inclusion body myositis, thyrotoxic myopathy, 

20 ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock. Gushing* s 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and 
myopathies including encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic 
disorder, ophthalmoplegia, and acid mdltase deficiency (AMD, also known as Pompe's disease); an 
immunological disorder sudi as acquired immunodeficienc)^; syndrome (AIDS), Addison* s disease, 

25 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
ath^osclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APEGED), bronchitis, cholecystitis, contact 
d^matitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, aythroblastosis fetalis, erythema nodosuna, aitrophic 

30 gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves* disease, Hashimoto's 
thyroiditis, hyp^eosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis* osteoporosis, pancreatitis, polyniyositis, 
psoriasis, Reit^'s syndrome, rheumatoid arttuitis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopemc purpura, ulcerative 

35 colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
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circulation* viral* bacterial* fungal* parasitic* protozoal, and helminthic infections* and trauma; and a 
cell proliferative disorder such as actinic keratosis, art^osclerosis* atherosclerosis* bursitis* cirrhosis, 
hepatitis* mixed coiineirtive tissue disease (MCTO), myelofibrosis* paroxysmal nocturnal 
hemoglobinuria, polycyth^nia v^a* psoriasis* primary thrombocythemia, and cancers including 
5 adenocarcinoma, leukemia* lymphoma* melanoma* myeloma* sarcoma* t^atocarcinoma, and* in 

particular* cancers of the adrenal gland, bladd^, bone, bone marrow, brain* breast, cervjx* gall bladder* 

■• ■'»' • 

ganglia, gaistrointestinal tract, hearf, kidney, livex, limg, muscle, ovary, pancreas* parathyroid; ♦penis* 

I- 

prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. 

In anbther embodiment, a vector capable of expressing TRICH or a fragment or derivative 

10 thereof may be administea-ed to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TRICH including, but not limited to. those described above. 

In a further embodiment* a composition comprising a substantially puirtfied TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a 
disorder associated with decreased expression or activity of TRICH including* but not limited to, those 

15 providied above. ' • 

In still another ^bodim^ an agonist which modulates the activity of TRICH may be 
adininistered to a subject to treat or prevent a disord^ associated with decreased expression or activity 
of TRICH including* bnit not limited to, those listed above. 

In a furth^ embodiment* an antagonist of TRICH may be administered to a subject to treat or 

20 prevent a disorder associated with increased expression or activity of TRICH. Examples of such 

disorders include, but are not limited to, those transport, neurological, muscle* immunological, aiid cell 
proliferative disorders described above. In one aspect* an antibod}^ which specifically binds TRICH 
may be used directly as an antagonist or indirectly as a targeting or deilivery mechanism for bringing a 
pharmaceutical agent to ceto or tissues wWc*t e3q)ress TRICH. 

25 In an additional embodiment* a vector expressing the con^lement of the polynucleotide 

encoding TRICH may be administered to a subject' to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins* antagonists* antibodies, agonists* complementary 
sequences* or vectors of the invention may be administ^ed in combination with other appropriate 

30 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent* thus reducing the potential for adverse side effects. 
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An antagonist of TJRICH may be produced using methods which are generally known in the art 
In particular, purified TRICH may be used to produce antibodies or to screen libraries of . 
pharmaceuticfl agmts to identify those wliich specifically biiid TRICH, Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
5 not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragmmts, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generidlyprefenred for therapy , 

For the production of antibodies, various hoistis including goats, rabbits, rats, mice, humans, 
and others may bC; immunized by injection with TRICH or with any fragment or oligopeptide thereof 

10 i which has immunogenic properties. Depending on the hoSt species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gds 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

15 It is pr^erred that the oligopeptides, peptides, or fragments used to induce antibodies to 

TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino add sequence af the natural protein. Short stretdies of 
TRICH amino acids may be fused with those pf another protdn, such as KLH, and antibodies to the 

20 chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique whidi provides for the 
production of antibody molecules by continuous cdl lines in culture. These include, but are not linnited 
to, the hybrldoma technique, the human B-cdl hybridoma technique, and the EBY-hybridoma 
tedmique. (See, e,g., Kohler, G. et al. (1975) Nature 256:4?S-497; Kozbbr, D. et al. (1985) J, 

25 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sd, USA 80:2026-2030; and 
Cole, S.P. et al. (1984) MoL Ceai Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
' antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L, et al. (1984) Proc. 

30 Natl.. Acad. Sci. USA 81:6851-6855; Neubergo:, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454,) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
chain antibodies. Antibodies with related specificity, but of diistinct idiotypic composition, may be 
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generated by chain shuJfflJig from random coinbinatorial ii^ (See, e.g./ Burton, 

D.R. (1991) Proc. NaQ. Acad Sci. USA 88:10134-10137.) 

y^itibodies may also be produced by inducing ii^^ 
or by screening immunoglobulin libraries or panels of higWy specific binding reagents as disclosed in 
5 the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Nafl. Acad. Sd. USA 86:3833-3837; Winter, 
a etal. (1991) Nature 349:293-299.) . ■ . ' 

Antibody fragments whicl^ contain specific binding sites for TRICH may also be gerierat^. 
- For example, such fjragments include, bnt are not limited to, F(ab% fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
10 F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:1275-1281.) i 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols' for competitive binding or immunor^diometric assays using either 
15 polyclonal or mohoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurmient of complex formation between TRICH and its 
siiecific antibody. A two-site, monoclonal-abased immunoassay utilizing monoclonal antib^ 
to two nonrinterfering TRICH epitopes is generally used, but a competitive binding assay may also be 
employed (Poimd, su]K^- 

20 Various metbods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 
constant, which is d^ned as the molar concentration of TRICH-antibody complex divided by the 
molar concentrations of free antigen and free antibody imdea: equilibrium conditions. The determined 
for a preparation of pdlydonal antibodies, which are heterogeneous in their affinities for multiple 

25 TRiCHepitopes,rg)resentstheav€^ageaffinity, or avidity, of the a^^ TheK, 

determined for a preparation of monoclonal antibodies, which are monospecific for a particular TRICH 
epitope, represents a true measure of afBnity. High-afBnity antibody preparations with ranging from 
about 10^ to 10^^ L/mole are prefored jfor use in immunoassays in which the TRICH-antibody conq)lex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 

30 about 10^ to 10^ L/mole are preferred for use in immunopurification and similar procedures v*ich 
ultimately require dissociation of TRICH, preferably in active form, from the antibody (Catty, D. 
(1988) Antibodies> Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, J.E. and A. 
Cryer (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, New York NY). 
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The txt&c and avidity of polyclonal antibody preparations may be further evaluated to detOTnine 
the quality and suitability of such preparations for certain downst^ For example, a • 

polyclonal antibody preparation containing at leafet 1-2 liife spe?cific antibody/nal, preferably 5-10 mg 
spedfic antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
5 complies. Procedures for evaluating antibody speipifidty^ titer, and avidity, and guideJines for 

antibody quality and usage in various applications, are generally available. (See, e,g.. Catty, supra , and 
Coligan et al. supra .) . ' - ^ , 

• . In anothCT embodiment of the invention, the polynucleotides encoding TRICH, or any fragment 

or complement thereof, may be used for therapeutic purposes. ^ In one aspect, modifications of gene 
10 » expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding TRIGH. 
Such technology.is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding TRICH. 
(See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 
15 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly ill the form of an expression plasmid which, upon transcription, produces a sequence 
• . complementary to at least a portion of the cellular sequence encoding tlie target protein. (See, e.g.. 

Slater, J.E. et al. (1998) J. Allergy CUn. Immunol. 102(3):469-475; and Scahlon, K.J. et al. (1995) 
20 9(1 3): 1 28 8- 1 296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Hier. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
25 ' al. (1998) J. Pharm.Sd- 87(1 1):1308-1315; and Moms, M.G.etdl, (1997) NucMcAa^ 
25(14):2730-2736,) 

La another embodiment of the invention, polynucleotides encoding TRICH may be used for 
somatic or gemdine gene therapy. Gene therapy may be perfonned to (i) correct a gmeUc deficiency 
• (e.g., in the cases of severe combined iramunodeficiracy (SCID)-X1 disease characterized by X-linked 
30 inheritance (Cavazzana-Calvo, M, et al. (2000) Science 288:669-672), severe combined 

immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
^laese, R,M. et al. (1995) Scieaice 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. al. (1993) Ce31 75:207-216; Crystal, R.G, et al. (1995) Hum. Grae 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Hio-apy 6:667-703), thalassamias. familial 
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hypo-cholestarolemia, and hemophilia resulting from Factor Vm or Factor DC deficiencies (Crystal, 

•II .- ' ■ . . 

R.G. (1995) Science 270:404-410; Venna, I,M. and N. Somia (1997) Nature 389:239-242)), (ii) ' 
express a coiiditionaUy lethal gaie product (e.g., in the case of cancers which result from unregulated 
ceil prolifeation), or (iii) express a protdn which affords proiectidn against intracellular parasites (e.g., 
5 against human retroviruses, such as human immunodeSciehcy virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescWa, E/et al. (1996) Proc. Natl. Acad S 

hepatitis B or C virus (HBV, HCV); fungal parasite, §uch as Candida albicans and Paracoccidioides 
' brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi^ In the 
case wh^e a genetic deficiency in'TRICH expression or regulation causes disease, the expression of 

10 ' XRICH frcHn an appropriate population of transduced cells may alleviate the clinical manifestations' 
caused by the genetic deficiency. 
, . In a furth^ embodiment of the invention, diseases or disord^s caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoduig TRICH and introducing 
these vectors by mechanical means into TRICH-defidlent cells. Merfianical transfer technologies for 

15 use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivay, (iu) liposome-mediated transfection, (iy) receptor-mediated gene transfer, and (v) 
theuse of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Cell 91 :501-510; Boulay, J-L. and H. R6cipon (1998) Curr. Opin. Bidtechnd. 
9:445-450). 

20 Expression vectors that may be effective for the expression of TRICH include, but are not 

limited to, tiiePCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors Gnvitrogen, Carlsbad CA), 
PPMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagme, La Jolla CA), and IT^ 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH may be expressed 
using (i) a constitutiVely active promote:, (e.g., flrom cytomegalovirus (CMV), Rous sarcoma virus 

25 (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 
tetracyclinfr-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Nati. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al, (1995) Science 268:1766-1769; Rossi, F.M.V. and H,M. Blau (1998) 
Curr. Opia BiotechnoL 9:451-456), commercially available in the T-REX plasmid Gnvitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogm); the 

30 FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M. V. 
and Blau, H.M. supra)), or (iii) a tissue-specific promotear or the native promoter of the endogenous 
gene encoding TRICH from a normal individual. 

Comma-cially available hposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogeai) allow one witii ordinary skill in the art to deOiver 
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polynucleotides to target cells in cultoe and reqiu^ ' 
parameters. In the altaiiative, transformation is perforined using the calcium phosplw • 

(Graham, FX. and A.X Eb (1973) Virology 52:456-467), oir by dectrqporation (tJeumaim, E. et id. 

.11-" 

(1982)EMB6 J. 1:841-845). Theintroductionof DNA to primary ceils requires mqdiiSQati^^ 

5 standardized mammalian transfection protocols. 

In another ranbodimmt of the invention, diseases or disorders caused by gaietic defects with 
respect to TRICH ^ression are treated by constructing a letrovirus vector .consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

10 element (RRE) along with additional retrovirus m-acting RNA sequences and coding sequences 
required fdr efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are , 
commercially available (Stratagene) and are based on published data (Ri,viere, L et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herdn. The vector is propagated in an 
appropriate vector producing cdl line (VPCL) tliat expresses an ehvelppe.gene with a tropism for 

15 recq}tors on the target cdls or a promiscuous envdope protdn such as VSVg (Armentano, D. et al. 
(1987) J. Virol, 61:1647-1650; Brader, M.A. et aj. (1987) J, Virol. 61:1639-1646; Adam, M.A and 
A.D, Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Wol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Numbo: 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high tralisdudng efficiency retroviral supernatant) discloses a 

20 method for obtaining retrovirus packaging cdl lines and is horeby incorporated by reference. 

Propagation of retrovirus vectors, transduction of a population of cdls (e.g., CD4* T-cells), and the 
return of transduced cdls to a patient are procedures wdl kho\yn to persons skilled in the art of gene 
therapy and have beai wdl documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, MX. (1997) J. Tnol. 71:4707-4716; Ranga, U. et al. 

25 (1998) Proc. Nafl. Acad. Sd. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gdie tharapy ddivery system is used to ddiver 
polynucleotides encoding TRICH to cells wliich have one or more genetic abnormalities with respect to 
the expression of TRICH. The construction and packaging of adenovirus-based vectors are wdl known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 

30 versatile for importing genes encoding immimoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A et al. (1999) Annu. 
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Rev. Nutr. 10:511-544 and Verma, I.M. andN. Somia (1997) Nature 18:389:239-242. both ' 

incQiporated by refermce hardn. 

- - t I I * • ft' 

In another alternative* a hapes-based, §ene therapy deJivoy system is uied to ddiya: 

polynucleotides ^coding TRICH to target cells which have one of more genetic abnormalities with 

5 respect to the expression of TRICH. The use of hapes sinq>let virus (HS V)-based vectors may be 

especially valuable for introdudng TRICH to cdls of the central nervous system, for which HSV has a 

tropism. The construction and paqkaging of herpes-based vectors are well known to those with , 

ordinary skill in the ?rt. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 

been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1 999) Exp. Eye Res. 

10 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to Dd.uca ("Herpes simplex virus strains for gene transfer"), wliich is 
hereby incorporated by reference. U.S, Patent Number 5,804,41 3 teachep the use of recombinant HSV 
d92 which consists of a gmome containing at least one escogenous gene to be transferred to a cdl under 
the control of the appropriate promoter for purposes induding human gene therapy. Also taught by this 

15 patent are the coifetruction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Coins, W.F. et al.'(1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) Dev. 
Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the genaration of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large herpesvirus genomes, the growth and propagation of 

20 herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary 
skin in the art. 

In another alternative, an alphavirus positive, single-stranded RNA virus) vectcar is used to 
delivo" polynudeotides encoding TRICH to target cdls. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has be^ studied extmsivdy and g^e transfer vectors have been based on 

25 the SFV genome (Garoff, H. arid K.-3. Li (1998) Curr. Opia Biotechnol. 9:464-469). During 

alphavirus RNA replication, a subgenomic RNA is generated that normally racodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, resulting 
in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e,g. , 
protease and polymerase). Similarly, ins^ting the coding sequence for TRICH into the alphavirus 

30 genome in place of the capsid-coding region results in the production of a large number of TRJCH- 
coding RNAs and tbe synthesis of high levels of TRICH in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a parsistent 
infection in hamster normal kidney cells (BHK'-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic rq)lication of alphaviruses can be alt^ed to suit the needs of the gene th^apy application 



BNS0OCID:<WO 0204520A2 I > 



wo 02/04520 ' PCTAJSOl/21448 

(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of TRICH into a variety of cell types. The specific transduction of a subset of cdls in a • 
population maiy require the sorting of cells prior to transdbctiop. The inethods of manipulating ' 
infectious cDNA clones of alphaviruses, pa*forming alphavirus cDNA and RNA transfections» and 
5 performing alphayirus infections* are well Imown to those witii ordinary skill in the art 

Oligonucleotides derived from the transcription initiation site» e.g., between about positions -10 
and +10 from the start site, may also be employed to inbibit gene ejipressian, Siniiilarly, inhibition can 
be achieved using triple helix base-pairing metbodology. Triple helix pairing is usrful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 

10 J transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Ruber, B.E. and B.L Carr, : 
Molecular and Immunologic Approaches ^ Futura Publishing, Mt KiscoNY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes, . ► 

15 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA The mechanism of ribozyme action involves sequence-spetific hybridization of the ribozyme 
molecule to complementary target RN A» followed by ^donucleolytic cleavage. For example, 
engineered hamm^head motif ribozyme molecules may spedficaDy and e 
endonucleolytic cleavage of sequences encoding TRICH. 

20 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: QUA, 
GUU, and GUC. Once identified, short RNA sequ^ces of between 15 and 20 ribonucleotides^ 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary stnjctural features which may roido: the oUgonuclkrtideinoper^^^ The suitability of 

25 candidate targets may also be evaluated by testing accessibiUty to hybri 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucldc acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic add molecules. These include techniques for 
' chemically synthesizing oligonucleotides such as solid phase phosphoramidite chanical synthesis. 

30 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding TRICH. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize.complementary RNA, constitutivdy or inducibly, can be introduced into cell lines, cells, or 
tissues. 
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RNA molecules may be modified to increase intraceUular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends* 
of the naolecule, or the use of phosphorothioate or 2* D-methyl rather than phosphodiesterase linkages 
within the bac]d)one of the molecule. This conce|)t is inherent! in the production of F'NAs and can be 
5 extended in an of these molecules by the inclusion of nontraditional bases such as inosine» queosine» and 

wybutosine, as well as acetyl-, methyl-, thio-, .and similarly modified forms of adenine, cytidine, 

• • »■ 

guanine, thymine, and uridine which are not as easily r9Cognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screenii\g for a 
compound which is effective in altering expression of a pplynucleotide encoding TRICH. 

10 ' Compounds which may be effective in altering expression of a specific polynucleotide may include^ 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
maaomolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inliibitdrs or 

15 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TRICH expression or activity, a compound which specifically iilhibits expression of the 
polynucleotide encoding TRICH may be therapeutically usefiil, and in the treatment of disord^s 
associated with decreased TRICH expression or activity, a compound which spedfically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

20 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method . 
coimnonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection firom an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical comj^ounds; rational design of a compound 

2S ' based on chemical and/or structural prop^es of the target pblynucl^ 

library of chemical compounds created comj^inatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is e^^osed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
• biochemical system. Alterations in ttie e^tpression of a polynucleotide encoding TRICH are assayed . 

30 by any method commonly known in the art . Typically, the eTcpression of a specific nucleotide is . 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 

35 exposed to a test compound indicates that the test compound is effective in altering the expression of 
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the polynucleotide. A screen for a compound effective in altering expression of a specific ' 
polynucleotide can be carried out, for example, using a Schizosaccharomvces pombe gene ^ressioh 
system (A1]dns> D. et- al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) W a human cell line such as HeLa cell (Qarke, MX. et al. (2000) Biochem. Biophys. 
5 Res. Comniun. 268:8rl 3). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonijcleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotide?) for antiseilsp activity against a specific polynucleotide 
sequence (Bniice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S.' 
Patent No. 6.p22,691). , 

10 Many methods for introducing vectors into cells or tissues are available and equally suitable for 

use in viva in vitrg and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken . 
from the patient and clonally propagated for autologous triansplant back into that siame patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved . 
using methods which are wdlkppwln In the art. (See, e.g., Goldman, C.K. et al. (1997) Nat . 

15 Biotechnd. 15:462^66.). 

Any of the therapeutic methods described above may be applied to any subjea in need of such 
tb^apy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and. 
monkeys. 

An additional ^bodiment of the invention relates to the administration of a composition which 
20 * generally comprises an active ingredient fc^raulated with a pharmaceutically acceptable excipiait 
Excipients inay include, for example, sugars, starches, ceDuloses, gums, and protdns. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing> EastonPA;^. Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or^nhibitors of TRICH. 
25 The compositions utilized in this ihv^on may be administered by any ^n^ 

including, but not limited to, oral, ii^avenous, intrainuscular, intria-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prq)ared in liquid or dry powd^ form. 
30 These compositions are genially aerosolized immediately prior to inhalation by the patient In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art In the case of macromolecules (e.g. larger pq)tides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar, region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. et al., U.S. 
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Patent No. 5,997,848). Pulmonary ddivary has the advantage of administration without needle 
injection, and obviates the need for potmtiany t03dc penetrati^ 

Compositions suitable for use in the invention indude compositions wherein the active . 
ingredients are contained in an effective amount to achieve th6 intended purpose. The determination of 
5 an effective dose is well within Ibe capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments ther^. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular ijelivery of the 
macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
10 ' terminal portion from the HIV Tat-1, protein. Fusion protdns thus generated have been found to ' 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
, 31.(1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as ihice, rats, rabbits, dogs, monkeys, 
15 or pigs. An animal modd may also be used to determine the appropriate concentration range and route 
of administration. Sudi information can then be used to determine useful doses and routes for 
administration in humans, 

A therapeutically effective dose refa-s to that amount of active ingredient, fOT 
or fragmmts thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, which 
20 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with expearimmtal animals, sudi as by 
calculating the ED50 (the dose therapeutically effective in 50% of the pqjulation) or LD50 dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to th^apeutic effects is the 
thCTapeutic index, which can be expressed as tiie LDjt/EDso iatio. Compositions which exhibit large 
25 therapeutic indices are preferred. The data obtained from cefl culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED50 with litfle or no toxicity. 
The dosage varies within this range dqpending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 
30 ' The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring freatment. Dosage and administration are adjusted to provide sufRdent levcJs of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the sevaity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequmcy of adminisfration, drug combination(s), reaction sensitivities, and response to therapy. 
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Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly dq)ending 
on the half-Me and dearance rate of the particxdar forninlation. 

Normal dosage amounts ihay yary jfirpm about 0.1 fig to 100,000 /^g, np tb a total dose of 
about 1 grami depending upon the route of administration. Guidance as to particular dosages and 
methods of ddivery is provided in the lito-ature and genoraUy available to practiticiners in the art. 
Those skilled in the art will employ diff^ent formulations for nucleotides than for proteins or their 
inhibitors. Similarly, ddiva^'of ijolynucleotides or polyiJqytides will be sp^iflc to particulari cells, 
conditions, locations,^& 
DIAGNOSTICS . 

In another embodiment, antibodies which specifically bind TRICH may be used for the 
diagnosis 6f disord^s characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitor's of TRICH, Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for TRICH include methods which iiti lize the antibody and a label to detect TRICH in human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used, • 

A variety of protocols for measuring TRICH, including ELIS As, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing aitextd or abnormal levdis of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH under 
conditions suitable for con?>lex formation. Tlie amount of stanjdard complex formation may be 
quantitatied by various mefliods, siidi as photometric means. Quantities of TRICH expressed in 
subject, control, and disease samples from biopsied tissues are compared with the stairiard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the inv^on, the polynucleotides encoding TRI 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complemraitary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gaie e3q)ression in biopsied tissues in which expression of TRICH may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TRICH or closely related molecules may be used to 
identify nucldc acid sequences which encode TRICH. The specificity of the probe, whetiier it is made 
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from a higWy specific region, e. g. , the 5 * regulatory region, or from a less specific region, e.g.,' a 
conserved motif, and the stringency of tbe hybridization or amplification will detOTnine whether the 
probe identifies: only naturally oonirring seguenoes encoding TRICH, allelic variants, dr related 
sequences. 

5 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TRICH encoding sequeaices. Tbe hybridization probes qf the subject 
invention may be DNA or Rl^A aijid may be derived from* the sequence of SEQ ID NO:33-64 from 
genomic sequences including promoters, enhancers^ and introns of the TRICH gene. 

MeaAs for producing specific hybridization probes for DNAs encoding TRICH include the 

10 cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 

production of uiRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RN A probes in vitro by means of the addition of the appropriate RN A 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as ^^P or ^^S, qr by enzymatic labels, such as 

15 alkaline phosphatase coupled to the probe via avidinA)iotiii coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disord^s 
associated with expression of TRICH. Examples of such disorders include, but are not hmited to, a 
transport disordCT such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy. Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 

20 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson* s disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia; tardive dyskinesia, dystonias, peripheral 
. neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, nayocarditis, 

25 cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, i)Olymyositis, neurological disorders associated with transport, e.g., Alzheim^'s 
disease, amnesia, bipolar disord^, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psydioses, and schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, 

30 postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafuess, hypa'glyceniia, 
hypoglycemia. Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose. 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and 
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Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease. Pick*s disease, Huntington's disease, dementia, Parkinson^s 
disease and other extrapyramidal disorders, amybtrophici lateral sclerdjsis and other motor neuron' 
disorders, progressive neurial muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
5 sclerosis and other demyelinating diseases, bacterial and viiral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system diseas'e, prion diseases including kuru, Creutzfeldt- Jakob disease, and 
■ GCTstriiann-Straussler-Scheinker syndrome, fatsQ familial insomnia, nutritional and metabolic diseases 

of the nervous system, neurofibnjmatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

10 , encephalotrigeminal syndrome, mental retardation and other developmental disorders of the centr^ 

• * • ■ ...» 

nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 

15 disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, 
Duchenne's muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, central core 

20 disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis, polymyositis, dermatomyositis, inclusion body myositis, thyrotoxic myopathy, 
ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock. Gushing' s 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheoehromocytoma, and . 
myopathies including encephalopathy, epilepsy, Keams-Sayre syndrome, lactic acidosis, myoclonic 

25 . disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe*s disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addisoii's disease, 
adult respiratory distress syndrome^ allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
ath^osclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 

30 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myastiienia gravis, 
myocardiaa or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 

35 psoriasis, Reiter's syndrome, fhemnatoid arthritis, scleroderma, SjSgren's syndrome, systemic 
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anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, a!nd behimithio infections, and trauma; and a 
cell proliferative disorder such as actinic keratosis, arterioscl^osis, atherosclerosis, bursitis, cirrhosis, 
5 hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

hemoglobinuria, polycythemia va-a, psoriasis, primary thrombqcytheinla, and cancers including 
admocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma,, teratocardnoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, c^^, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, limg, muscle, ovary, pancreas, parathyroid^ penis, 

10 » prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and ute^^ ' 
sequences encoding TRICH may be used in Southam or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
— assays; and in microairays utilizing fluids or tissues from patients to detect altered TRICH ejqjression. 

Such qualitative or quantitative methods are wen known in the art • 

15 In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 

detect thepresenceof associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient nndar conditions suitable for the formation of hybridization compleKes, After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

20 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered leyds of nucleotide sequ^ices encoding TRICH in the sample 
mdicates the presence of the assoaated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient 

25 In order to provide a basis for the diagnosis of a disorder associated with expression of TRICH, 

a normal'or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell retracts taken from normal subjects, dther animal or human, with a sequence, or a 
fragment thereof, encoding TRICH, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 

30 values from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 
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Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be rq)eated on a regular basis to detennine if the 1^^ 
patient begins to approximate that which is obsarved in the normal subject! The'results obtained JBrom 
successive assays may be used to show the ^cacy of treatment* over a period ranging from several 
5 days to monflis. 

With respect to cancer, the presence of an abnormjQ amount of transcript 
ovarexpressed) in biopsied tissue fifom an individual may indicate a prediisposition for the devdoppient 
of the disease, or m^y provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 

10 preventative measures br aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. , 

Additional diagnostic uses for oligonucleotide^ designed from th^ sequ^ces encoding TRICH 
may involve the use of PGR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers wid preferably coittain a fragment of ,a polynucleotide encoding 

1 5 TRICH, or a fra^ent of a polynudebtide compl^entary to the polynucleotide encoding TRICH, and 
will be employed under optimized conditions *for identification of a specific gene or condition. 
Oligomers may also be raiployed under less stringent conditions for detection or quantification of * 
closely related DNA or RNA sequmces. 

In a particular aspect, oligonucleotide primers derived firom the polynucleotide sequences 

20 encoding TRICH may be used to detect single nucleotide polymc«rphisms (SNPs). SNPs are 

substitutions^ insertions and ddetions that are a frjequent cause of inhoited or acquired gametic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymcMphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primes 
derived from the polynucleotide sequences encoding TRICH. are used to amplify DNA using the 

25 polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normsd 
tissue, biopsy samples, bodily fluids, and the like. ' SNPs in the DNA cause differences in the secondary 
and tertiary structures of PCR products in isingle-stranded form, and these differences are detectable 
. using gel dectrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimCTs in high-throughput equipment such as 

30 DNA sequendng machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
ova-lapping DNA fragments which assemble into a common consensus sequence. These compute- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing . 
errors using statistical modds and automated analyses of DNA sequence chromatograms. In the 
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altanative* SNPs may be detected and characterized by mass spectrometjry using, for exampl^ the high 
throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to qu?qitily the expression of TRICH include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucldc add, and interpolating results firom 
5 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem, 212:229-236.) ; The speed of quantitation of multiple samples may be * 
accdearated by running the assay i|i a high-thi-ougbput fcralat where the oligomer or polynud'ebtide of 
interest is presenteddn various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation.' ' ^ 

10 In iurther embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as dements on a microarray. The microarray 
can be used iii transcript imaging tediniques which monitor the relative expression levds of large 
numbors of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and ^xdymbrphisms. Tjbis information may be used to d[etainine gene 

15 function, to undd-stand the genetic basis of a disorder, to diagnose a disorder, to monitor 

progression/regression of disease as a function of grae ^ression, and to devdop and monitor the 
aictivities of therapeutic agents in the treatment of disease; In particular, this information may be used 
to devdop a pharmacogenomic profile of a patient in or(Jer to sdect the most appropriate and effective 
treatment regimen for that patimt For example, therapeutic agents which are highly effective and 

20 display the fewest side effects may be sdected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, TRICH, fragments of TRICH. or antibodies specific for TRICH may 
be used as d^ents on a microarray. The microarray may be use<? to monitor or measure protdn- 
protdn intearactions, drug-target interactidns, and gene expression profiles, as described above. 

A particular embodiment rdat^ to the use of the polynucleotides of the present invention to 

25 gena-ate a transcript image of a tissue or cell type. A transcript image represents the global pattea-n of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and thdr rdative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis,*' U.S. Patent Number 
5,840,484, expressly incorporated by reference herda) Thus a transcript image may be generated by 

30 hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherdn the polynucleotides of the present 
invoition or thdr complemrats comprise a subset of a plurality of elements on a nucroarray. Hie 
resultant transcript image would provide a profile of gene activity. 
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Transcript images may be geno-ated using transcripts isolated froni tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo> as in the • 
case of a tissue or biopsy sample, or in vitra as in the case of 9: cell fine.'. . 

Transcript images which profile the expression of the polynucleotides of the present invention 
5 may also be used in conjunction with in vitro model systems and preclinical evaluation of 

pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, firequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of actioh and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol jCarcinog. 24:153-159; Steiner', S. and NX. Anderson (2000) 

10 iToxicoL Lett. 112-113:467-471, expressly incorporated by reference herein). If a test compound has* a; 
signature similar to that of a compound with knovwi toxicity, it is likely lo share those toxic propearties. 
These fingaprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a g(Miome-wide measuremmt of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 

1 5 compounds are important as well, as the levels oif expression of these genes are used to normalize the 
rest of the expression data. ThenorniaUzationprocedureis useful *for comparison of e3q)ressi^ 
after treatment with different compounds. While the assignment of gene function to dem«its of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 

20 maniple, Press Release 00-02 from the National Institute of Enviroimiental Health Scimces, released 
February 29, 2000, available at httpy/www,nidis,nih,gov/oc/news/toxcWp.ht^ TherefCHre, it is 
important and desirable in toxicological screening using toxicant signatures to include an expressed 
gene sequences. 

In one embodim^t, the toxicity of a test compound is assessed by treating a biological sample 
25 containing nucleic acids with the test compound. Nucleic acids that are e:3q)ressed in the treated 
biological' sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
30 are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cdl type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
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analyzed by quantifying the'nuixjbar ^9f expressed proteins and thdr rdatiye abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be genera;ted by separating and 
analyzing the polypeptides of a particular tissue or cell type. lii one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
5 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Stdner 'and Anderson, supraX The proteins are 
visualized in the gel as discrete and uniquely i)6sitioned spots, typically by staining the gel wiitli an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
genially proportional to the level of fhe protein in ttie sample. The optical densities of equivalency 

10 positioned protein spots from different sainples, for example; from biological samples either treated or 
untreated with a test compound or th^apeutic agent, are compared to identify any changes in prdtdin 
spot density related to the treatment The protdns in the spots are piartially sequenced usinjg, for 
example, standard niethods employing chemical or enzymatic cleavaige followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequ^ce, preferably of 

IS at least S contiguous aniiilo add residues, to the polypeptide sequences of the pres^ invention. In 
some cases, further sequence data may be obtained for definitive protdn identificatioa 

A proteomic profile may also be generated using antibodies specific for TRICH to quantify the 
levds of TRICH expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression leyels are quantified by exposing the microarray to the sample and detecting the 

20 levels of protein bound to each array elraient (Lueidng, A. et al. (1999) Anal. BiochenL 270:103-1 11; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be paformed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample witii a thiol- or anDdno- 
reactive fluorescent compound and detedmg the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome levd are also useful for toxicological screening, and should 

25 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
' between transcript and protein abundances for some proteins in some tissues (And^son, N.L. and J. 
Seilhamer (1997) Electrophoresis 1 8:533-537), so proteome toxicant signatures may be useful in the 
analysis of compoimds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of traiiscripts in body fluids is difficult, due to rapid 

30 degradation of mRNA. so proteomic profiling may be more reliable and informative in such cases. 

In anotho: embodiment, the toxicity of a test compound is assessed by treating a biological 
. sample containing proteins with the test compound Proteins that are expressed in the treated biological 
sample are sq)arated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protdn in an untreated biological sanaple. A dif^M-CTce 
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in the amount of protein between the two samples is indicative of a toxic response to the test compound 
■•»■ ' . ■ • • . • • . . • , . 

in the treated sample. Individual proteins are identified by sequKidng the amino add residues of the 

individiial protdns and comparing these partial sequences to the polypeJ)tides of the present inventioa 

In another embodimisnt, the toxiidty of a test c 

5 sample containing proteins with the test compound Proteins from the biological sample are incubated 

with antibodies specific to the polypeptides of the present inveiition. T^e amount of protein recognized 

by the antibodies is quantified. The amount of protein in the treated biological sample is compared vrtth 

the amount in an untreated biological sample. A difference in the amount of protdn between the two 

samples is indicative of a toxic rdponse to the test compound in the treated sample. 

10 * I Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

: Brennan, T.M. et ^,. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

. USA 93:10614-10619; Baldeschwdler ^ ai. (1995) PCT application W095/2511 16; Shalon, D. et al. 

(1995) PCT appUcation WO95/35505; Hdler, R,A. et al. (1997) Proc. Natl. Acad. Sd. USA 94:2150- 

2155; andHdl^, M.J. et ^. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 

15 known and thoroughly described in DNA Microarrays: A Pracfical Approadi . M. Schena, ed. (1999) 

Oxford UnivCTSity Press, I^ndon, hereby expressly incQipiorated b 

In another embodiment of the invaition, nucleic acid sequences encoding TRICH may be used 

to graerate hybridization probes useful in mapping the naturally occurring genomic sequ wee. rather 

coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 

20 preferable over coding sequences. For example, conservation of a coding sequence among members 

of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 

mapping. The sequences may be mapped to a particular duromosome, to a spedfic region of a 

chromosome, or to artifidd diroiiiosome constructions, e.g., huinan artifidal dxromosomes (HACs), 

yeast artifidsd chromosomes (YACs), bacterial artifidal dirdmosomes (BACs), bacteriail PI 

25 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J, et al. (1997) Nat 

Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Gen^. 

7 :149- 1 54.) Once mapped, the nucldc acid sequences of the invatitiori may be used to devdop genetic 

linkage maps, for example, whidi correlate the inheritance of a disease state with the inheritance of a 

particular chromosome region or restriction fragment length polymorphism (RFLP). (See, for 

30 example. Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

data. (See, e.g.,HeinZ'Ulrich,etal. (1995) in Meyers, sa^» PP- 96^ Examples of genetic map 

data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 

WOTld Wide Web site. Correlation between the location of the gene encoding TRICH on a physical 
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I 

map and a specific disorder, or a predisposition to a specific disorder, may lidp define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomai preparations and physical mapping techniques, sucji as 

* ... - 

linkage analysis using established chromosomal markers, may be used for extending gmetic maps. 

5 ORen the placement of a gme on the chromosome of anoth^ mammalim\ species, such as mouse> may 
reveal associated markers even if the exact chf omospmal locus is not knowa This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 

10 * mapping to. that area may represent associated or regulatory genes for furth^ investigation. (See, e.fe., 
Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal, carriar, or affected individuals. 

In anoth^ embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

15 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening Aiay be free in solution, affixed to a 
solid support, borne on a cdl surface, or located intracdlularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

Anotho: technique for drug screening provides for high throughput screening of compounds 

20 having suitable binding affinity to the protdn of interest (See, e.g., Geysen, et al. (1984) PCT 

application WO84/03564.) In this method, large numbers of different small test compounds are 

synthesized on a soKd substrate. The test compounds are reacted with TRICH, or fragments thereof, 

* - ■ ■ • 

and washed. Bound TRICH is then detected by methods well known In the art. Purified TRICH can 

also be coated directly onto pliates for use in the aforementioned drug screening techniques. 

25 Alteniativ€iy,non-heutrahzing antibodies can be used to captiure the pqptideM^ 

solid support. 

. In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TRICH specifically compete with a test cbn5)0und for binding TRICH. 
' In this manner, antibodies can be used to detect the presmce of any peptide whidi shares one or more 
30 antigenic determinants with TRICH. 

In additional embodiments, the nucleotide sequences which encode TRICH miay be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currenfly known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 
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Without further elaboration, it is believed that one skilled in the art' can, using the preceding 
description, utilize the pi*e!sent invention to its fullest'extent The following embckJiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. ' ' * 

5 The (fisclosures of all patents, applications and publications, mentioned above and below 

including U.S. Ser. No. 60/216,547, U.S. Ser. No. 60/218,232, U.S. Ser. No. 60/220,1 12, and U.S. 

I - - ' 

. ■ I . 

Ser. No. 60/221,839 are exprfessly incoaparated by reference herda, are expressly incoiporated by 
risfo-^ceherdn. 

•* ' ■ . ' 

10 ' . • * EXAMPLES 

!• Construction of cDNA Libraries 

Incyte cDN As were derived from cDNA libraries described in the LIFESEQ GOLD database 

(Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5, Some tissues w^e homogenized 

and lysed in guanidihium isothiocyanate, whiie others were homogenized and lysed in phenol or in a 

15 suitable mixture pf denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol 
and guanidine isothiocyanate. The resulting lysates ware centrifuged ovar CsCl cusliions or extracted 
with chloroform. RNA was precipitated from the lysates with dthac isopropanol or sodium acetatp and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were rqpeated as necessary to increase RNA 

20 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex piarticles (QIAGEN, 
Chats wOTth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Altenatively, RNA was 
isolated directly from tissue lysates using oth^ RNA isolation kits', e.g., (he POLY(A)PURE mRNA 
purification Mt (Ambion, Austin TX). 

25 In some cases, Stratagene was provided with RNA and constructed the correspon^g cDNA 

libraries. Otha^se, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagaie) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods knovm in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6,6.) Reverse transcription was initiated using oligo d(T) or random primars. Synth^c 

30 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with tbe 
appropriate restriction enzyme or oizymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Am^sham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
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PBLUESCRlPTplasmid (Stratagene), PSPORTl plasmid (LifeTedinologies), PCDNA2.1 plasmid 
anvitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINC Y (Incyte Genomics^ Palo Alto 
CA). or derivatives thereof- Recx)Aibinant plasmids were transfbnned into compbtent E. coli cells 
including XLi-Blue, XLl-BlueMRF, or SOLR from Stratagene»or DHSa, DHIOB, or ElectroMAX 
5 DHIOB from Life Technologies. 

11, Isolation of cDNA Clones . 

Plasmids obtained as desopbed in Example I were recovered from host cells by in vivo expision 
using the UNIZAP vector system (Stratag^e) or by cell lysis. Plasmids w^e purified using at least 
one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 

10 Miniprep purification kit (Edge Bibsystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid; QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid 
purification kit from QI AGEN. Following precipitation, plasmids werer^esiispended in 0.1 nil of 
distilled water and stored, with or without lyophilization, at 4 C. 

Alternatively, plasmid DNA was amplified from host cell lysa,tes using direct link PGR in a 

15 high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cdl lysis and theamal 

cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
weM plates, and the concentration of amplified plasmid DNA was quantified fiuorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

20 III* Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example n were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as tiie ABI CATALYST 800 (Applied Biosystems) thearmal cycler or tiie PTC-200 thermal cyclar 
(MJ Research) in conjunction with the HYDRA microdispeaiser (Robbins Scientific) or the 

25 MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacist Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequmcing reactions and detection of labded polynucleotides were 
carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 

30 PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
supra , unit 7.7). Some of the cDNA sequences were selected for cKtension using the techniques 
disclosed in Example VIII. 
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The polynucleotide sequences d^ived from Incyte cDNAs WCTe V 
linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and programs based 
on BLAST, dynamic programming, and dinucleotide nearest heighbor analysis. The Incyte cDNA- 
sequ^ces or translations thereof were then queried against a selection of public databases such as ttie 
5 GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, 

DOMO, PRODOM, and hidden Markov model (HMM)-based protein family databases such as PFAM. 

• . , ' . ■ ■ 

(HMM is a probabilistic appf oach which analyzes consensus primary structures of gene families. 
See,fdr example,Eddy,S.R. (1996) Curr. Opin.StracL Biol. 6:361-365.) The queries. ware 
performed using programs based (f>n BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA 
10 , sequences w^e assembled to produce fuU length polynucleotide sequences. Alternatively, GenBankj 

I ' ...» ' . ; 

cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequences (see Examples TV and V) were used to extend Incyte cDNA assemblages to ftill IcngQL 
Assembly was performed using programs based on Phred, Phrap, and Consed, and cDNA assemblages 
were screened for open reading fr anaes using programs based on Gen^Mark, BLAST, and FASTA. 

15 The fulUCTigth polynucleotide sequences wa:e translated to deri 

polypeptide sequences. Altarnatively, a polypq)tide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. FuH length polypeptide sequences were subsequently 
analyzed by qu^ying against databases such as the GenBank protein databases (genpept), SwissProt, 
BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden Markov model (HMM)-based protein 

20 family databases such as PFAM. Full length polynucleotide sequences are also analyzed using 
MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASERGENE software (DNASTAR). Polynucleotide and polypepticfesequ^ce aUgmn^ats are 
g^erated using default parameta^s specified by the CLUSTAL algoriflmi as incorporated into the 
MEGALIGN multisequence aUgnment program (DNASTAR), wMch also^c 

25 identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, refi^ences, and threshold 
parametears. The first column of Table 7 shows the tools, programs, and algorithihs used, the second 
• column provides brief descriptions thereof, the third column presents appropriate references, all of 

30 which are incorporated by reference herein in their entirety, and the fourth column presents, where 

applicable, the scores, probability values, and other parameters used to evaluate the strength of a match 
between two sequences (the higher the score or the lower the probability value> the greater the identity 
between two sequaaces). 

71 



BNSDOCID: <WO 0204S20A2 I > 



wo 02/04520 ' PCT/USOl/21448 

The programs described above for the assembly and analysis of full l^igth polynucleotide and 
pblypq)tide sequences ware also used to identify polynucleotide sequence fragments from SEQ ID • 

NO:33-64. Fragments from about 20 to about 4000 nucieoti(ies which are usefiil in hybridizatiou arid 

» * • . .. • . 

amplification tedmolbgies are described in Table 4, coluinn 4: • 
S IV. Identification and Editing of Coding Sequences frofn Genomic DNA 

Putative transporters and ion c*annel$ were initially identified by running the Genscan gene 
identification program against public genonrfc sequence databases (^.g., gbpri and gbhtg). Genscan is a 
general-purpose gene identification program which analyzes genomic DNA sequmces frbm a variety of 
organisms (See Burge, C. and S. KarUn (1997) J. Mol, Biol 2.68:78-94,* and Burge, C. and S. Karlin 

10 ' (1998) Cxirr. Opin. Struct Biol. 8:346-354). The program concatenates predicted iexons to fonii an* 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan js a 

' ^ FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To detoroine which of these Genscan predicted cDNA 
sequences encode transporters and ion channels, the encoded polypeptides weare analyzed by querying 

15 against PFAM models for transporters and ion channels. Potential transporters and ion channds were 
also identified by homology to Incyte cDNA sequences that had been annotated as transporters and ion 
diannels. These selected Genscan-predicted sequmces were then compared by BLAST analysis to file 
genpq)t and gbpri public databases. Wherenecessary, the Genscan-predicted sequences were then 
edited by comparison to the top BLAST hit flrom genpq>t to correct errors in the sequrace predicted by 

20 Genscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 
public cDNA covCTage of the Genscan-predicted sequences, thus providing evidence for tf anscripfiba 
When Incyte cDNA coverage was available, this information was used to correct or confirm the 
Genscan predicted sequence. Full Imgth polynucleotide sequences wore obtained by assembling 
Genscan-predicted coding sequences with Incyte cDNA sequences and/or public cDNA sequences using 

25 the assembly ijrocess described in Example III. Altaiiativdy, full length polynucleotide sequences were 
derived ehtirdy from edited or unedited Gensfcan-predicted coding sequences, 
y. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" iSeanences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene identification 
30 program described in Example IV. Partial cDNAs assembled as described in Example HI were mapped 
to genomic DNA and parsed into clusters containing related cDNAs and Genscan exon predictions from 
one or more genomic sequences. Each cluster was analyzed using an algorithm based on graph theory 
and dynamic programming to integrate cDNA and genomic information, generating possible splice . 
variants that were subsequentiy confirmed, edited, or extended to create a fiill length sequence, 
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Sequence intervals in which the entire length of the interval was present on niore than one sequence in 
tbe clust^ were identified/ and inta-Vals thus identified were considered to be equivalent by trapsitivity. 
For example, if an interval* was present on a cDN A and two tgenomic sequences* ithen all three intervals 
were considear&d to be eqiiivalent This process allows unrelated but consecutive genomic sequences to 

5 be brought together, bridged by cDNAsequmce. Intervids thus identified were then "'stitched'' to - 
by the stitching algorithm in the order that they appear along their parent sequmces to genea-ate the 
longest possible sequence, as wdl as sequence variants. Linkages between intervals which pi:9ceed 
along one type of parent sequence (cDNA to cDNA or genomic sequence to genomic sequrace) wfere 
given prefereijice over linkages which change parent type (cDNA to genomic sequence). The resultant 

10 stitched sequences were translated and compared by BLAST analysis to the genpept and gbpri public 
databases. » Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit . 
fi^om genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. • . . 

*^Stretched^^ Sequences , • * 

15 Partial UJ^A sequences ware extended to full length with an algorithm based on BLAST 

analysis. First, partial cDNAs assembled as •described in Example III were queried against public 
databases such as the GenBank primate, rodent, niammalian, v^tebrate, and eukaryote databases .using 
the BLAST programu The nearest GenBank protein homolog was then compared by BLAST analysis 
to either Incyte cDNA sequ^ces or GeuScan exon predicted sequences described in Example IV. A 

20 chimeric protein was generated by using the resultant high-scoring segmmt pairs (HSPs) to map the 
translated sequences onto the G^tiBank protein homolog. Insertions or deletions may occur in the 
chimeric protein with respect to the original GenBank protdn homolog. The GenBank protein homolog, 
the chimeric proton, or both were used as probes to search for homologous genomic sequences from the 
public human genome databases. Partial DNA sequences w^e thea-efore "stretched" or ^ctended by the 

25 addition of homologous gmonoic sequences. The resultant stretched sequences were examined to 
determine whether it contained a complete gene. . 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 w^e compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 

30 implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 
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had been previously mapped. Inclusion of a mapped sequence in a dusl^ resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

.Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in cehtiMorgans, is measured relatiye to the terminus of the chromosome' s p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between, 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) ' The cM 

distances are based on genetic markers mapped by G6n6thon which provide boundaries, for radiation 
hybrid markers whose sequences Were included in each of the clusters,' Human genome maps and 
10 ' othtT resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site ' 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
, disease genes map within or in proximity to the intervals indicated above. 
Vll. Analysis of Poly nucleotide Expression 

Northern analysis is a laboratory technique uSed to detect the presence of a t-anscript of a gene 
15 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See. e.g., Sambrook, supra, eh. 7; Ausubel (1995) 
SUES, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for idaitical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizationSi In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
T^le basis of the search is the product score, which is defined as: 

BLAST Score x Porceift Identitv 
5 X minimum {length(Seq. 1), length(Seq. 2)) 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. Tlie product score is a normalized value between 0 and 100. and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5'times the length of thp shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
ev^y mismatch. Two sequences may share more tban one HSP (separated by gaps). If th^e is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
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example, a product score of 1 00 is produced only for 1 00% identity over the entire length of the shorter 
of the two sequences bdng compared- A product score of 70 is produced dther by 1 00% identity and 
70% overlap at one eiid, or by 88^ identity and 100% overtop, at to^ 

produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap, 

5 Altearnatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

tissue sources from which they were derived. For example, some full length sequences are assembled, 
at least in part, with overlapping Ljicyte cDNA sequences '(see Example III). , Each cDNA sequence is 
d^ived from a cDNA hbrary constructed from a human tissue. Each human tissue is classified into one 
of the following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 

10 embryonic structures; taidocrine system; exocrine glands; genitalia, fanale; graitalia, male; germ cells; 
hemic and immune systmi; liver; musculoskeletal system; nervous syst^; pancreas ; respiratory 
system; sense organs; skin; stomatognathic syst^; unclassified/mixed; qr urinary tract The number of 
libraries in each category is counted and divided by the total number of libraries across all categories. : 
Similarly, each human tissue is classified into one of the following dispase/condition categories: cancer, 

15 cell line, developmental, infiammation, neurological, trauma, cardiovascular, pooled, and other, and the 
number of libraries in each category is counted and divided by the total numbff of libraries across all 
categories. The resulting p^centages reflect the tissue^ and disease-specific expression of cDNA* 
encoding TRICH, cDNA sequaices and cDMA library/tissue information are found in the LIFESEQ 
GOLD database (Incyte Genomics, Palo Alto CA). 

20 VIII. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequ^ices were also produced by extension of an apprq^riate 
fragment of the fun length molecule using oligonucleotide primes designed from this fr agm^ One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3* extension of the known fragment The initial primers were designed using 

25 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of abotit 50% or more, and to anneal to the target sequence 
at temperatures of about 68 ''C to about 72*'C. Any stretch of nucleotides which would result in hairpin 
structures and primar-primer dimarizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 

30 \yas necessary or desired, additional or nested sets of primers were designed. 

High fiddity amplification was obtained by PGR using methods weB known in the art. PGR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Researdi, Inc.). The reaction 
mix contained DNA template, 200 ranol of each primar, reaction buffer containing Mg^, (NH4)2S04. 
and 2-ma*captoethanol, Taq DNA polymerase (Amershani Pharmacia Biotech), ELONGASE enzyme 
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(Life Tecamdogies), and Pfb DJMA j)olymerase (Strat^gene), with the foijowing parameters for primer 
pair PCI A and PCI B: Step 1: 94 X, 3 min; Step 2: 94''C, 15 sec; Stop 3: 60^C, 1 mia; Stqp 4: 68*C, 

2 min; Step 5 : Steps 2, 3, and 4 rq)eated 20 times; Stq? 6: 68 *'C, 5 min; Step 7: storage at 4 °C. In the 

• ■ • ■ . 

alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: P^^C, 3 min; Step 2: 

5 94^, 15 sec; St^ 3: 57°C, 1 min; Step 4: 68**C, 2 min; Step 5: Steps 2, 3, and 4 rq)eated 20 times; 

Step 6: 68°C, 5 min; Stq) 7: storage at 4°e. 

The conc^tration of DN A in each v^ell was detenfained by dispensing 1 00 pi PICOGkEEN 

quantitation reagent (0,25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 

and 0.5 |il of*nndiluted PCR product, into each wdl of an opaque fluorimeter plate (Coming Costar» 

10 * Acton MA), allowing the DN A to bind to the reagent The plate was scanned in a Fluoroskan II 

• '.*••• ■ ■ • ■ 

(Labsystems Oy, Hdsihld, Finland) to measure the fluorescence of the sample and to quantify the 

concentration of DNA. AS filto 10 /A aliquot of the reaction mixture v*as analyzed by dectrophoresis 

on a 1 % agarose gel to determine which reactions were successful in extrading the sequence. 

The beaded nucleotides ware desalted and concentrated, transfdred to 384- weH plates, 

15 digested with Cvbl choleira virus endonuclease (Molecular Biology Research, Maidison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia BiotechX For 
shotgun sequencing, the digested nucleotides were sq)arated on low concentration (0.6 to 0.8%) agarose 
gels, fragmoits ware excised, and. agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

20 Pharmacia Biotech), treated with Pfti DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells wore selected on antibiotic-contaiiiing 
media, and individual colonies were picked and cultured overnight at 37 /'C in 384-well plates in LB/2x 
carb liquid media. 

The cells were lysed, and DNA was amplified by PGR using Taq DNA polymo-ase (Ama-sham 
25 Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
' 94 '^C, 3 min; Step 2: 94 ''C, 15 sec; Step 3: eO^'cl 1 min; Step A: 72''G, 2 min; Stqp 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72**C, 5 min; Stq? 7: storage at 4*'C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples wore diluted with 20% dimethysulfoxide (1 :2, 
30 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Tarminata: cycle 
sequmdng ready reaction kit (Applied Biosystans). 
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In like manner, full langth polynucleotide sequraices are vetx&ed using the above procedure or 
are used to obtain 5 ' regulatory sequences usipg the above procedure along with oligonucleotides 
designed for such extension, and an appropriate genom * . 

IX. Labeling and Use of Individual Hybridization Ihrobes > 

Hybridization probes derived from SEQ ID NO:33-64 are employed to screen cDNAs. genomic 
DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with l^gear nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligonier, 250 /^Ci of [y-^^P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston » 
MA). Thelabded'oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech), An aliquot containing 10^ counts per 
minute of the labeled probe is used in a typical mmibrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase l, Bgl 11, Eco IRI, Pst I, Xba I, or 
Pvu n (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40*'C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodiima dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared 

X. Microarrays 

The linkage or synthesis of array elements upon a microairay can be achieved utilizing 
photolithography, piezodiectric printing (ink-jet printing, See^ e.g., Baldeschwdlear, supra .\ mechanical 
microspotting technologies, and derivatives thereof. The substrate in eadti of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), supra\ Suggested 
substrates include silicon, silica, gjass slides, glass chips, and silicon wafers. Altmiatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines wdl known to those of ordinary skill in the art and may 
contain any appropriate numbea- of elements. (See, eg., Schena, M, et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Rbs. 6:639-645; Marshall, A and J. Hodgson (1998) Nat Biotechnol. 
16:27-31.). 
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Fun length cDNAs; Expressed Sequence Tags (ESTs). or fragments of oligomers thereof may 
comprise the demmts of the microarray. Fragments or oligomers suitable for hybridization can be * 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elOTients are hybridized with polynucleotides in a biological samplfe. The polynucl^tides in the 
5 biological sample are conjugated to a fluorescent labd or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from. the biological sample are removed, and.a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser - 
desorbtion and mass spectrometry may be used for detection of hybridization. Thedegreeof 
complementarity and the relative dbundance of each polynucleotide whiCh hybridizes to an element on 

10 ' ttie microarray may be assessed. In.one embodiment, microarray preparation and usage is described in 
detail below. 
I Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
polyCA)"^ RNA is purified using the oligo-(dT) cellulose melliod. JSadi poly(A)* RNA sample is 

15 reverse transcribed using MMLV revarse-transcriptase, 0.05 pg/\A oligo-(dT) prim^ (21mer), IX first 
strand buffer, 0.03 units/fjl RNase inhibitor, 500 pM dATP, 500 *mM dGTP, 500 nM dTTP, 40 pM 
dCTP, 40 mM dCTP-Cy3 (BDS) or dCTP-CyS (Amersham Pharmacia Biotech), the reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A)* RNA with, 
GEMBRIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 

20 from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr. each reaction sample (one 
with Cy 3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to ttie stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration ^in columns (CLONTECH Laboratories, Ina 
(CLONTECH), Palo Alto CA) and after combining, both reliction samples are ethanol precipitated 

25 using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holhrook NY) and 
resuspendedin 14pl5XSSC/0.2%SDS. ' 
MicroairraV Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
30 amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 
Jig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

78 



BNSOOCID: <WO_0204520A2_L> 



wo 02/04520 PCT/USOJ/21448 

Purified array elements are immobilized on polymer-coated glass Slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with exte^sive distilled watex 
washes between and after 'treatmelnts. Glass slides are etdifed.in 4% hydrofluoric}' acid (VWR 
Scientific Prdducts Corporation (VWR), West Chester PA), wj^shed extensively in distilled water, and 

5 coated with 0,05% aminopropyl silane (Sigma) in 95% ethanoL Coated slides are cured in a 1 lO^C 
oven. . 

Array elements are af)plied to the cqated glass suljstrate using a procedure describedi in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an avWage 
concentration of 100 hg/pl, is loaded into the open capillary printing elem^t by a high-speed robotic 

10 apparatus. The apparatu's then deposits about 5 hi of array element sample per slide. 

MicroarraysareUV-crosslinkedusingaSTRATALINKERl^ (Stratageaie), 
Mieroarrays are washed at room temperature once in .0.2% SDS and three times in distilled wat^. 
Non-specific binding sites are blocked by incubation of mieroarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Incj, Bedford MA) for 30 minutes at 60° C followed by washes in- 

15 0.2% SDS and distilled water as before. 

Hybridization . 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 \xg each of Cy3. and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buff^. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered with 

20 an 1.8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 Ml of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60"* C. The arrays are washed for 10 min at 45** C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (O.IX SSC), and dried. 

25 Detection 

Repofter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coh^ent, Inc., Santa Qara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. Ihe excitation laser light is 
focused on the array using a 20X micrbscope objective (Nikon, Inc., Melville NY)- The slide 

30 containing the array is placed on a computer-controlled X- Y stage on the microscope and rast^- 
scanned past the objective. The 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluordphores sequentially. 
Bmitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT Ri477, 

35 Hamaraatsu Photonics Systems, Bridgewater N3) corresponding to the two fluorophores. Appropriate 
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filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 mh for Cy3 and .650 nm for Cy5. Each array is 
typically scann^ twice, one scan'per fluorophore using thei appropriate filters it ihe laser source, 
although the apparatus is capable of recording the spectra firom.hoth fluorophores simultaneously. 

5 The sensitivity of the scans is typically calibrated using the signal intensity gen^ated by a 

cDNA control species added to the sample mixture at a known, concentration. A specific location on 
the array contains a complementary DNA sequence, alloiving the intensity pf the signal at tlaat 
location to he correlated with a weight ratio of hybridizing species of 1 :100,000. When two samples 
firom different sources (e.g., representing test and control cells), each labeled with a different 

10 fluorophore, are hybridized to a single array for the purpose of identijfying genes that are diffearentially 
expressed, the calibration is done by labeling sample^ of the calibrating cDNA with the two 
fluorophores .and adding identical amounts of each to thfc hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog DbVices, Inc., Norwood MA) installed in an IBM-compatible PC 

15 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20~color transformation to a pseudocolor sc2lt ranging from blue Oow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first coirected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

20 A grid is superimposed over the fluorescence signal image such that the signal fi-om each spot 

is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XL Complementary Polynucleotides 

25 Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used to 

detipct, deorease, or inhibit expression of naturally .occurring TRICH. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence firagments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequrace of TRICH. To inhibit transcription, a 

30 coniplementary oligonucleotide is designed j^om the most unique 5 * sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the TRICH-encoding transcript. 
XIL Expression of TRICH 



BNSDOCiO:<WO 0204520A2 I > 



wo 02/04520 • PCTAJSOl/21448 

Expression and purification of TRICH is achieved using bacterial xx virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an apprcpriate vector 
containing ah antibiotic resistance gene anh an inducible j^romoter that directs bigh levds of cDNA' 
transcription. Examples of such promoters include^ but are not limited to, the trp-lac {tac) hybrid 
5 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac capor ator regulatory 
dement Recombinant vectors are transformeji into suitable bactmal hosts, e.g., BL21 (DE3). 
Antibiotic resistant bacteria express TRICH upon induction w • * 

thiogalactopyranoside (IPTG). Expression of TMCH in eukaryoticceDs is acMeved by irfect^ 
or mammalian cell lines with recombinant Autographica califomica nuclear polyhedrosis virus 

10 * (AcMNPV), commoidy known as baculovirus, Thenones'sential polyheAingeneof baculovirus is ♦ 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hqpatocytes, in some cases/ 

15 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Nati. Acad, Sci. USA 91:3224-3227; Sandig, V. 'et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In nK)st expression systems, TRICH is synthesized as a iusion protdn wi^^ 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-HiSi permitting rapid, single-step, 

20 afifMty-based purification ofrecombinantiusion protein from c^ GST, a 26-lcilodalton 

enzyme fi-om Schistosoma faponicum. enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protdn activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolyticaDy cleaved from TRICH at 
specifically engineered sites. FLAG, an S-amiino acid pq)tid^ enables immunoaffinity purification 

25 using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables ptirification on noetal-chdate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausube! (1995, sutara . . 
ch. 10 and 16). Purified TRICH obtained by these methods can be used directly in the assays $ho\i^ in 
' Examples XVI, X\ni, and XVIII whdre applicable: 

30 XIII- Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
devated levels in mammalian cdl culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expressioa Vectors of choice 
include PCiVIV SPORT (Life Technologies) and PCR3. 1 (Invitrogen, Carlsbad CA), both of which 
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contain the cytomegalovirus promoter. 5- 10 //g of recombinant vector are transiently transfedted into a 
human cell line, for example, an ^dpthelial or hematopoietic cdl line,, using either liposome ; 
formxdatiom or dectroporation. l*-2 /zg of an adcMtional pWsnud contaiirihg seqheacices e 
marker protdn are co-transfected. ^pressipn of a marlcer protein provides a means to distinguish 
5 transfected cells from nontransfected cells and is a reliiable predicts of cDNA expression from the 
recombinant vector. Marker protdns of choice include, e.g,, Gveexi Fluorescent Protein (GFP; 
Clohtech), CD64, or a GD64-GEIj fusion protein. Row cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

10 fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 
cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alt^ations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 

15 and alterations in plasma .membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
(1994) Plow Cvtometrv . Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified populations 
of cells transfected with sequences ^coding TRiCH and dther CD64 or CD64-GFP. CD64 and 

20 CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected ceils are dfidently sq)arated from nontransfected cdls using 
magnetic beads coated with either human IgG oir arttibody against CD64 (DYNAL, Lake Success NY). 
mRNA cimbepurifiedfromthecdlsusingmethods wen Iqiown by those Expression 
of mRNA encoding TRICH and other genes of interest can be analyzed by northern analysis or 

25 microarray techniques. 

XIV, Production ofTRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

30 Alternatively, the TRICH amino add sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
. synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrqphilic regions are wdl 
described in the art (See, e.g., Ausubel. 1995. supra , ch. 11.) 
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Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) usiBg FMOC (Aenfistry and couple^^ 
Aldrich> St. Lduis MO) by reaction with N-maldimdobehioyl-N-hydroxysuqcinimi^ ester (MBS) to 
increase inmiunogenidty. (See, e,g,, Ausubd, 1995, su^O Rabbits are inuimnized with the 
5 oligopeptide-KLH complex in complete Freund's adjuvant Resulting antisora are tested for antipeptide 
and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate^ blocking with 
1 % BSA, reacting with rabbit antisera, washing, and reacting with i;adio-iodinated goat anti-rabbit IgG. 
XV. Purificationof Naturally Occurring TRICH Using SpecilBc Antibodies 

Naturally occurring or reQombinant TRICH is substantially purified by inununoaffinity 

10 I chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 

I ...» • ■ ■ • 

covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (AmCTshani Pharmacia Biotech). Aft^ the coupling, the resin is 
blocked and washed according to the manufacturer's instructioiis^ 

Me(fia containing TRICH are passed over the immunoaffinity column, and the colunon is 

15 washed und^ conditions that allow &e preferential absorbance of TRICH (e.gM high ioiflc streaigth 
buffers in the presoice of dete*gent). The colunm is eluted tmderbonditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaptrope» sudi 
as urea or thiocyanate ion), and TRICH is collected. 
XVI* Identification of Molecules Which Interact with TRICH 

20 Molecules which interact with TRICH ma.y include transpiorter substrates, agonists or 

antagonists, modulatory protdns such as opy proteins (Rdmann, supra) or protdi^ involved in TRICH 
localization or clustering such as MAGUKs (Craven, supraV . TRICH, or biologically active firagmesits 
thereof, are labeled with ^^I Bolton-Hunt«- reageoL (See, e.g., Bolton AJS. and W.M. Hunt^ (1973) 
Biochem. J. 133:529-5390 Candidate moleciiles previously feayed in the wells of a miiiti-weill plate 

25 are incubated with the labeled TRICH, washed, and any wells with labeled TRICH complex are 

assayed. Data obtained using different concehtrations of TRICH are used to calculate values for the 
number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid syst^ 
' (Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thareof, are expressed as 

30 fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion protdns with an activation domain. Interactions between the TRICH fusion protein 
and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
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conmiercially available, and methods for use of the yeast 2*hybrid syst^ with ion chaniid proteins are 

discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzynaol. 293:104-122). * 
^ TRICH may also be used in the PATHCALLING process (CuraGoi Corp., New Havoi CT) 

which employs the yeast two-hybrid system in a high-throughput iftanna*. to determine all interactions 
5 between the proteins encoded by two large libraries of genes (Nandabalan; K. et al. (2000) U.S. Pati^ 

No. 6,057,101). ■ ... ; . 

Potential TRICH agonists or antagonists may \)e tested for lactivation or inhibition of TRICH 

ion channel activity using the assays described in section XVm. • 

XVII. Demonstrationof TRICH Activity 
10 ' , Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ioil 

conductance, TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
, pr CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 

comma'cially available, and the techniques to introduce them into cells are well known to those 

skilled in the art A second plasmid which expresses any one of a niamber of mark^ genes, such as B- 
15 galactosidase, is co-transformed into the cells to aUow rapid identification of those cells which have 

taken up and expressed the foreign DNA, The cells are incubated for 48-^72 hours after transfonhalion 

under conditions appropriate for the cell line to allow expression and accumulation of TRICH and fi- 

galactosidase. 

Transformed cells expressing fi-galactosidase are stained blue when a suitable colorimetric 

20 substrate is added to the culture media und^ conditions that are well known in the art Stained cells 
are tested for diffa*ences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransformed ceUs, mid/or cells transformed with either vector sequences- alone or 
fi-galactosidase sequences alone, are used as controls and tested in paraU Cells expressing TRICH 
win have higher anion or cation conductance relative to control cells. The contribution of TRICH to 

2S conductance can be confirmed by incubating the cells using antibodies specific for TRICH. The 
antibodies wiU bind to the extraceflular side of TRICH, hereby blocks 
and the associated conductance. 

Alternatively, ion diannel activity of TRICH is measured as current flow across a TRICH- 
• containing Xenopiis laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 

30 al., supra : Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

appropriate Xenopus oocyte expression vector, isuch as pBF, and 0.5-5 ng of mRNAis injected into 
mature stage IV oocytes. Injected oocytes are incubated ^ 18**C for 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KCl, 
and 10 mM Hepes (pH 7.2), The intracellular solution is supplemented with varying concentrations 

35 of the TRICH mediator, such as cAMP, cGMP, or Ca*^ (in the foim of CaCl^), where appropriate. 
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Electrode resistance is set at 2-S MQ, and electrodes are filled with the intracelliilar solution lacking 
mediator. Experiments are performed at room tonperafure fi-om a holding potential of 0 mV. Voltage 
ramps (2.5 s) from -lOO to 100 mV are acquired at a sampling. frequency of 500 Hz. Current measured 
isproportionaLltotheacti\dty of TRICHintheassay. , . 

5 In particular, the activities of TRICH-1, TRICH-2, and TRICH-10, are measured as 

conductance, the activities of TRICH-6 and TRICH-9 are measured as conductance in the presence . 
of membr^e stretch or free fatty acids, the activities of TRICH-18, TRICH[-25 and TRICH-31 are 
measured as voltage.-gated K* conductance, TRICH-5 activity is measured as Cr conductance in the 
presence of GAB A, TRICH-1 1 activity is measwed as cation conductance in the presence of heat, and 

10 the activity of TRICH-^9,TRICH-*28 is measured as Ca^+cojaductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into, 
Xmopus laevis oocytes. Oocytes at stages V and VI are injected with TlpICH mRNA (10 rig pea: 
oocyte) and incubated for 3 days at 1 8°C in OR2 ihedium (82.5mM NaQ, 2.5 mM KQ, ImM CaQ 2, 
ImM MgQa, ImM Na2HP04. S.mM'Hqjes, 3,8 mM NaOH , 50iig/ml gentamydn, pH 7.8) to allow 

15 expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM NaQ, 2 mM 
KQ, ImM CaQa, ImM MgClj, 10 mM Het)es/Tris pH 7,5). Uptake of varioiis substrates (ag.. 
amino adds, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e:g. 
radiolabeled with ^H, fluorescmtly labded with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is tmninated by washing the oocytes three times in Na*-free medium, measuring the 

20 incorporated label, and comparing with controls. TRICH a.ctivity is proportional to the level of 
internalized labeled substrate. In particular, test substrates include pigment precursors and related 
molecules for TRICH-3, ammophospholipids for TRICH-4, fru.ctose and glucose for TRICH-7 and 
TRICH-15, amino acids for TRICH-8, Na*andio(fidefQr TRICH-12, Na+andH*for TRICH-13 and 
TRICH-21, Na* and glucose for TRICH-16 and TRICH-19/ind glucose for TRICH-23, TRICH-26, 

25 TRICH-29,TRICH-30,andTRICH-32. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[y-^^P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
recovered ^^P using a scintillation counter. The reaction mixture contains ATP-Iy-^^] and varying 
amounts of TRICH in a suitable buffo: incubated at 37 "^C for a suitable pmod of time. The reaction is 

30 terminated by acid precipitation with trichloroacetic add and then neutralized with base^ and an aliquot 
of the reaction mixture is subjected to membrane or filtar paper-based chromatography to sq>arate the 
reaction products. Theamountof^^ liberated is counted in a scintillation counter. The amount of 
radioactivityrecoveredisproportionaltotheATPaseactivity of TRICH in the assay, 
XVIII. Identificationof TRICH Agonists and Antagonists 
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TRfCH is eq)ressed in a eukaryotic cdl line such as CHO (Chinese Hamster Ovary) or HEK 
(Human JEmbryoiiic Kidney) 293. Ion daannd activity of the transformed cells is measured tot^^ 
presence and absence of candidate agonists or antagonists. Ion channel activity Ss'^assayed usiog patch 
clamp methods weQ known in the art or as described in Example XVII. Altemativdy, ion channel 
activity is assayed using fluorescent techniques that measure ion flux across the cell membrane 
(Velicdebi, G.^ al. (1999) Meth. Enzymol 294:20-47; West, M.R. and C.R. Molloy (1996) Anal. 
Biochem. 241 :5 1 -58). These'assays may be ^adapted for high-throughput screening using microp^ates. 
Changes in internal ^on concentration are measured using fluorescent dyes such as the Ca^"*^ indicator 
Fluo-4 AM, sodium-sensitive dyes such as SBFT and sodium green, or the Cr indicator MQAE (all 
available from Molecular Probes) in combination vwth the FLIPR fluorimetric plate reading system 
(Molecular Devices). In a more generic va:sion of this assay, changes in membrane potential caused by 
ionic flux across the plasma membrane are measured lising oxonyl dyes puch as DiBAC4 (Molecular 
Probes). DiBAC4 equilibrates between the extracellular solution and cdlular sites according to the 
cdlular membrane potential. The dye's fluorescence intensity is iO-fpld greater when bound to 
hydrophobic intr^icdlular. sites, allowing detection of DiBAC4 entry into the cell (Gonzalez, J.E. and 
P.A. Negulescu (1998) Curr. Opin. Biotechriol. 9:624-631). Candidate agonists or antagonists may be 
selected from known ion channd agonists or antagonists, peptide libraries, or combinatorial chemical 
libraries. 

Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without dq)arting from the scope and spirit of the invention. 

Although the invention has been described in connection with certain embodiments, it should be 

' • • . • 

understood that the invention as ciiaimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for caitying out the invention which are obvious 
to those skilled in molecular biology or rdated fields are intmded to be within the scope of the following 
claims. ' ' 
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What is claimed is: 

» 

1. Aa isolated polypeptide selectfed froih the gWup consist • 

a) a polypeptide comprising an amino acid sequence selected from the group consistiag of 
SEQIDNO:l-32, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, . 

c) a biologically active fragment of a polypeptide having an ainino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected frqm 
the group consisting of SEQ ID NO: 1-32. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

3. An isolated polynucleotide encoding apolypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a ' 
polynucleotide of claim 3. 

25 . 

7. A ceU fransformed with a recombinant polynucleiotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1 » the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said . 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claiim 
l,and 

35 b) recovering the polypeptide so e^qpressed. 
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I 

10. An isolated antibody which specificaUy binds to a polype^^ 

1 1 . An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide seq[uence selected from the group consisting 

5 of SEQIDNO:33«64, 

b) a polynucleotide comprising a naturally occurring polynucleotide seque^^ 
identical to a polynucleotide sequence selected from tljie group coi>sisting of SEQ ID NO:33-64, 

c) a polynucleotide complementary to a polynucleotide of a), . ; 

d) a polynucleotide complementary to a polynucleotide of b), and 

10 » . e) .an RNA equivalent of a)Hd). * * 

1 2 An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

15 13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detectingthepresenceorabsenceof said hybridization complex, and, optionally, if 

present, the amount th«"eof. 

14. A method of claim 1 3, wherein the probe comprises at least 60 contiguous nucleotides. 

25 

1.5. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising : 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
, amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising a polypeptide of claim 1 and apharmaceutically acceptable 
excipient 

35 
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17. A composition, of claim 16, wherein the polypeptide has an amino acid sequence selected 
IBfom the group consisting of SEQ ID NO: 1-32. 

18. A method for treating a disease or condition associated with decreased expression of 

5 functional TRICH, comprising administering to a patient in need of such treatment the composition of • 
claim 16. , 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the ipethod comprising: 

10 a) exposing a simple comprising a polypeptide of claim 1 to a compound; and 

b) * detecting agonist activity in the sample, 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutLcaUiy acceptable excipient 

.5 ■ , . , ■ , ■ , ■ ■ , ■ ■ • ■ ;.■ 

21. A method for treating a disease, or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for saeening a compound for effectiveness as an antagonist of a polypepti 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. ' . 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceuticaUy acceptable excipient 

24. A method for treating a disease or condition associated with ov^expression of timctional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screwing for a compound thiat specifically binds to the polypq>tide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 
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* 

b) detecting binding of the |DOlypeptide of claim 1 to the test coinpound, thereby identifying a 
compound that specificany binds to the polypeptide of claim 1. 

26. A method of screening for a compound .that modulates the activity of the polypeptide of 
5 claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound xmder. conditions 
permissive for the activity of the polypeptide* of claim 1, ' • *' , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, and 

c) cdmparing the activity of the polypeptide of claim 1 in the presence of the test compound 
10 witti the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change in 

the activity of the polypeptide of claim 1 in the ipxesence of the test compound is indicative of a • 
compound that modulates the activity of the polypeptide of claim 1 . i 

27. A method for screeiling a compound fqr effectiveness in altering exprespon of a target 
15 polynucleotide^ vvherdn said target polynucleotide conqjrises a segui^ce of claim 5, the method 

comprising: 

a) exposing a sample con^)rising the target polynucleotide to a compound, under conditions 
suitable for the e3q)ression of tiie toget polynucleotide, . 

b) detectingalteredexpressionof the target polynucleotide, and 

20 c) comparing the expression of the target pol)aiucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. . 

28. A method for assessing toxitity of a test compound, said method comprising: 
a) treating a biological saniple containing nucleic acids with the test compoimd; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 imder conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or jaragment thereof ; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amoTUit of hybridization complex in an untreated biological sample, wheran a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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» . . • ■ 

29. A diagnostic test for a condition or disease. associated jwith the expression of TRICH in a 
biological sample comprising the steps of: 

a) combining the biological sample with an antibody of claim 10, under conditions suitable 
for the antibody to bind the polypeptide and form an antibody:polypeptide complex; and 

b) detecting the complex, wherein the presence of the eomplei correlates with the pres^ce 
of the polypeptide in the biological sample. 

' ■ " . . ; - ■ 

30. Theantibody of claim 10, wherein the antibpdy is: ^ 

* a) a chimeric antibody, 

b) a single chain antibpdy, 

c) a Fab fragment, 

d) a F(ab')2 fragment, or 

e) a humanized antibody. 

31. A composition comprising an antibody of claim 10 and an acceptable excipient 

32. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject.an effective amount of the composition of claim 
31.. 

33. A composition of claim 3 1 , wherein the antibody is labeled. 

34. Amettiodof diagnosing a condition or disease associated with the expression of 1^ 

2S in a subject, comprising administering to said subject an effective amount of the composition of claim 

33. " . ■ . 

35. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
10 comprising: 

30 ' a) immunizing an animal with a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-32, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibodies from said animal; and 
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I 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal , 
antibody which binds specificaliy to a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-32. . . 

5 36. Anantijbodyproducedby amethodofcIaimSS. 

• . • * . . • ; ■ • . 

• ■ f ' ' ■ ■ - • . 

■ ■ *.•.,• » . ' ' . * 

37. A composition comprising the antibody of claim 36 and a suitable carrier, . 

^ 38. A method of making a monoclonal antibody; with the specificity of the 

10 lb comprising: . 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 

' the group consisting of SEQ ID NO: 1 -32, or an immunogenic fragment thereof » und^ conditions to 
elicit an antibody response; 

b) isolating antibody producing cells from the animal; 

15 c) fusing the antibody producing cells with immortalizeia cells to form monoclon^^ 

producing hybiidoma cells; 

d) culturing the hybridoma cells; and 

e) isolatiiig from the culture monoclonal antibody which binds specifically to a polypeptide 
havinjg an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

20 

39. A monoclonal antibody produced by a method of claim 38. 

40. A composition comprising the antibody of claim 39 and a suitable carrier. 

25 41 . The antibody of claim 1 0, wherein the antibody is produced by screening a Fab 

expression library . 

42. The antibody of claim 10, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

30 ' 

43. A method for detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32 in a sample, comprising the steps of: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 
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b) detecting spepific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from toe group consisting of SEQ ID NO: 1-32 in 
the sample. ^ i i 

5 44. A method of purifying a polypeptide having an amino acid sequence selected from the 

group consistmg of SEQ ID 1^0:1-32 from a sample, the method comprising: 

a) incubating the antibody of claim *10 with a sample under conditions to allow specifiq 
binding of the antibody and the polypeptide; and 

b) separating the* antibody from the sample and obtaining the purified polypeptide having an 

10 amino add sequence selected fromthe group consisting of SEQ ID NO: 1-32. 

» '."•• 

45. Apolypeptideof claim 1, comprising the amino acid sequence of SEQ ID NO 

46. A polypeptide of ciyra 1 , comprising the amino acid sequence of SEQ ID NO:2. 

15 

47. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:3. 

48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID Np:4. 
20 49. A polypeptide of claim 1> comprising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1, conq)rising the amino acid sequence of SEQ ID NO:6. 

5 1 . A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:7. 

25 

52. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

53. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 
30 54, A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 10. 

55. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 11. 

56. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:12. 
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57. A polypeptide of claim 1, comprisiiig the amino acid sequence of SEQ ID NO:13- 
* 58- A polypeptide of claim 1, comprising the aminq acid sequence of SEQ ID NO:14; 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

60. A polypeptide of claim I, comprising the amino acid sequence of SEQ ID.NO:16. 

61 . A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 
62; A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:18. 
63. A polypeptide of claim 1, comprismg the amino acid sequence of SEQ ID NO:19; 

15 64. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:20. 

65. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:21. 

66. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:22. 

20 

67. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:23. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 
25 69. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:25 . 

70. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:26. 

7 1 . A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:27. 

72. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:28. 

73. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:29. 
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• ■ 

74. A polypeptide of diaiirt 1, comprising tbfe amino acid seqiidnce of SEQ ID NO:30, 

75 . A polypeptide of claim 1 , comprisifig the amino acid sequence of SEQ ID NO:3 1 . 
5 76. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:32. 

77. A polynucleotide of qlaim 1 1 , comprising the polynucleotide sequence of SEQ ID » 

NO:33. • 

.1 • - . 

10 78, A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

•■NO:34. ■ • . • 

. ■ ■ • . - I- . 

79. A polynucleotide of claipi 1 1 » comprising the polynucleotide sequence of SEQ ID 
NO:35. ' • . ' V » • 

15 ' 

■ ■ ♦ 

80. A polynucleotide of daim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:36. 

8 1 . A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

20 NO:37. 

82. A polynucleotide of claim 1 1 , comprisii^ the polynucleotide sequence of SEQ ID . 

NO:38. 

25 83. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:39. 

84. A polynucleptideof claim 11, comprising the polynucleotide sequeace of SEQ ID 
NO:40. , 

30 

85. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:41. 
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86. A polynucleidtide of cl&ra 1 1 , compiisiiig the polynucleotide sequepce of SEQ ID 
NO:42. ■ ' t : ■ . f ■ 

. • • • . , ; I I 

I * 

87. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

5 NO:43. 

. • * 

88. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID * 

NO:44. 

.1 ■ • 

10 89[. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:45. . . • 

» 

90. A polynucleotide of cJpini 1 1 , compiising the polynucleotide sequence of SEQ ID 
NO:46. • • ' ' 

15 . ■ .' • . ' . 

91 . A polynudeotide of claim 1 1 , compiising the polynucleotide sequence of SEQ ID 

NO:47. 

• . , 

92. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

20 NO:48. 

93. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:49. 

25 94. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

N0:50. 

95. A polynucleotide of claim 1 1 , comprising the polynucleotide sequenceof SEQ ID 

N0:51. 

30 

96. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:52. 
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97. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 
NO:53. , •' ;■ ■. . . ■ ' , . 

98. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

5 NO:54. 

• . I ' . • • • 

"I 

99. A polynucleotide of daim 1 1, cpnapriising the polynucleotide sequence of SEQ JD 
NO:55. • 

10 ' 100. A poiynudleotide of diaim 1 1, comprising tiie polynudeotide sequence of SBQ ID 
NO:56. 

101. A polynucleotide of daim 11, comprising the polynudeotide sequoice of SEQ ID 

NO:57. 

15 

102. A polynucleotideof claim 11, comprising the polymideotide sequence of SEQ ID 

NO:58. 

103. Apolynucleotideof claim Hi comprising the polynucleotide sequence of SEQ ID 

20 NO:59. 

1 04. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:60. 

25 105. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:61. ' • 

106. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:62. 
30 • 

1 07. A polynucleotide of claim 1 1 , comprising the polynudeotide sequence of SEQ ID 

NO:63. 
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1 08. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 



NO:64. 
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■/ 

<140> To Be Assigned 
<141> Herewith 

<150> 60/216,547; 60/218,232; 60/220,112; 60/221,.839 
<151> 2000-07-07; 2000-07-14; 2.000-07-21; 2000-07-2a 

<160> 64 

<170> PERL Program 

<210> 1 . ■ . 

<211> 332 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 3474673CD1 

<400> 1 

Met Tyr Arg Pro Arg Ala Arg Ala Ala Pro Glu Gly Arg Val Arg 

1 . .5 10 15 

Gly Cys Ala Val Pro Ser Thr Val Leu Leu Leu Leu Ala Tyr Leu 

20 25 .30 

Ala Tyr Leu Ala Leu Gly Thr Gly Val Phe T3rp Thr Leu Glu Gly 

35 40 45 

Arg Ala Ala Gin Asp Ser Ser Arg Ser Phe Gin Arg Asp Lys Trp 
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50 55 '60 

Glu Leu Leu Gin Asn Phe Thr eys Leu Asp Arg Pro Ala Leu Asp 

.65 m m -70 • . ' 75 • 

Ser Leu lie Arg Asp Val Val Gin Ala Tyr Lys Asn Gly Ala Ser- 
SO . , . 85 . . 90 

Leu Leu Ser Ash *hr Thr Ser Met Gly Arg Trp Glu Leu Val Gly 
. , 95 'idO • 105 

Ser Phe PhB Phe Ser Val Ser Thr lie thr thr IlevGly Tyr Gly 

110 115 120 

Asn Leu Ser Pro Asn Thr Met Ala Ala Arg Leu Phe cys lie Phe 

125 130 135 

Phe Ala Leu Val Gly lie Pro Leu. Asn Leu Val Val' Leu Asn Arg 

140 » 145 . 150 

Leu Gly His Leu Met Gin Glp Gly Vd.1 Asn His 'Trp Ala Ser Arg 

155 ' 160 165 

Leu Gly Gly Thr .Trp Gin Asp Pro Asp Lys Ala Arg Trp Leu Ala 

170 175 . 180 

Gly Ser Gly* Ala Leu 'Leu Ser ply Leu Leu Leu Phe Leu Leu Leu 

185' 190 195 

Pro Pro Leu Leu Phe Ser His Met Glu Gly Trp Ser Tyr Thr Glu 
. -200 205 210 

Gly Phe Tyr Phe Ala Phe lie Thr Leu Ser Thr Val Gly Phe Gly 

215 220 . ' 225 

Asp Tyr Val lie Gly Met Asn Pro Ser Gin Arg Tyr Pro L^u Trp 

230 235 240 

Tyr Lys Asn Met Val Ser Leu Trp lie Leu Phe Gly Met Ala Trp 

245 , » * . 250 . 255 

Leu Ala Leu lie lie Lys Leu lie Leu Ser Gin Leu Glu»Thr Pro 
, 260 265 270 

Gly Arg Val Cys Ser * Cys Cys His His Ser Ser Lys Glu .Asp Phe 

275 • 280 285 

Lys Ser Gin Ser- Trp Arg Gin Gly Pro Asp Arg Glu Pro Glu Ser 

290 295 300 

His Ser Pro Gin Gin Gly Cys Tyr Pro Glu Gly Pro Met Gly He 

305 310 315 

He Gin His Leu Glu Pro Ser Ala His Ala Ala Gly Cys Gly Lys . 

320 325 • 330 

Asp Ser 

<210> 2 
<211> 226 

<212> PRT . * * , 

<213> Homo sapiens • 

.<220> ' 
<221> misc_feature ■ 
<223> Xncyte ID No: 4588877CD1 

<400> 2 

Met Val Glu Met Gly Trp Asp Trp Ala Asp Arg Lys Asp Met Arg 
1 ; 5 10 15 

His Arg Leu Gin Ala Gly Asn Leu Glu Asn Thr Asp Gin Val Lys 

20 25. 30 

Ser Pro Leu Leu Thr Gly Asp Ser Ser Gly Leu Pro Pro Ala Pro 

35 . 40 45 

Ser Ala Pro Thr His Gly Val Lys Ala Ser Gly Gly Leu Gly Thr 

50 55 60 

He Leu His Pro Gin Asp Pro Asp Lys Ala Arg Trp Leu Ala Gly 

65 70 75 

Ser Gly Ala Leu Leu Ser Gly Leu Leu Leu Phe Leu Leu Leu Pro 

80 85 90 

Pro Leu Leu Phe Ser His Met Glu Gly Trp Ser Tyr Thr Glu Gly 

95 100 105 

Phe Tyr Phe Ala Phe He Thr Leu Ser Thr Val Gly Phe Gly Asp 
110 115 120 

Tyx Val He Gly Met Asn Pro Ser Gin Arg Tyr Pro Leu Trp Tyr 
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125 



Lys 


Asn 


Met 


Val 


Ser 


Leu 


Trp 


lie 


Leu 










140 










Ala 


Leu 


lie 


lie 


Lys 


Leu 


lie 


Leu 


Ser 










■ice 
155 










Arg 




Cys 


Ser 


Cys 

1 T A 


cys 


His 


His 


Ser 


Ser 


Gin 


Ser 


Trp 


X tv 

Arg 


Gin 


Gly 


Pro 


Asp 










185 








Ser 


Pro 


Gin 


Gin 


Gly 


Cys 


Tyr 


Pro 


Glu 










200 










Gin 


His 


Leu 


Glu 


Pro 


Ser 


Ala 


His* 


Ala 



215 , 

Ser 



<210> 3 . 
<211> 646 
f:212> PRT 

<213> Homo sapiens . . • 

<220> 

<221> mis cofeature 
' <223> Incyte ID No: 7472214CD1. 

<400> 3 



Met 
1 


Ala 


Glu 


Lys 


Ala 
5 


Leu 


Glu 


Ala Val 


Gly 


Ala 


Val 


Ala 


Met 
20 


Ala 


Val 


Thr Leu 


Pro 


Val 


Leu 


Thr 


Thr 


His 


Leu 


Lys Lys 










35 






Glu 


Ala 


Gin 


Arg 


Phe 


Ser 


His 


Leu Pro 










50 








lie 


Glu 


Phe 


Val 


Glu 


Leu 


Ser 


Tyr Ser 










65 






Trp 


Arg 


Lys 


Arg 


Gly 


Tyx 


Lys 


Thr Leu 










80 








Lys 


Phe 


Cys 


Arg 


Arg 


Glu 


Leu 


He Gly 










95 






Ala 


Gly 


Lys 


Ser 


Thr 


Phe 


Met 


Asn He 










110 








Ser 


Gly 


Met 


Lys 


Gly 


Gin 


lie 


Leu Val 










125 








Leu 


Arg 


Thr 


Phe 


Arg 


Lys 


Met 


Ser -Cys 










140 




Met 


Leu 


Leu 


Pro 


His 


Leu 


Thr 


Val Leu 










155 








Ala 


Asn 


Leii 


Asn 


Leu 


Thr 


Glu 


Asn Pro 










170 








Val 


Thr 


Glu 


Tie 


Leu 


Thr 


Ala 


Leu Gly 










185 






Thr 


Arg 


Thr 


Ala 


Leu 


Leu 


Ser 


Gly Gly 










200 






lie 


Ala 


Leu 


Glu 


Leu 


Val 


Asn 


Asn Pro 










.215 








Glu 


Pro 


Thr 


Ser 


Gly 


Leu 


Asp 


Ser Ala 










230 








Ser 


L'eu 


Met 


Lys 


Ser 


Leu 


Ala 


Gin Gly 










245 






Thr 


He 


His 


Gin 


Pro 


Ser 


Ala 


Lys Leu 










260 






Leu 


Tyr 


lie 


Leu 


Ser 


Gin 


Gly 


Gin Cys 










275 






Thr. 


Asn 


Leu 


He 


Pro 


Tyr 


Leu 


Lys Gly 










290 






Thr 


Tyr 


His 


Asn 


Pro 


Ala 


Asp 


Phe Val 



PCT/USOl/Zim 

130 135 



Phe 




Met 


Ala 


Trp 


Leu 


145 








150 


Gin 


Leu 


Glu 


•Thr 


Pro 


Gly 


160 






» 




165 


Ser 


Lys 


Glu 


Asp 


Phe 


Lys 


1/5 










180 


Arg 


Glu 


Pro 


Glu 


Ser 


His 


190 










195 


Gly 


Pro 


Met 


Gly 


He 


He 


205 










210 


Ala 


Gly 


Cys 


Gly 


i^ys 


Asp 


220 








• 


225 


Gly 


,Cys 


Gly 


Leu 


Gly 


Pro 


10 










15 


Glu 


Asp 


Gly 


Ala 


Glu 


Pro 


25 










30 


Val 


Glu 


Asn 


His 


lie 


Thr 


40 










45 


Lys 


Arg 


Ser 


Ala 


Val 


Asp 


55 










. 60 


Val 


Arg 


Glu 


Gly 


Pro 


Cys 


70 








75 


Leu 


Lys 


Cys 


Leu 


Ser 


Gly 


85 










90 


He 


Met 


Gly 


Pro 


Ser 


Gly 


100 










105 


Leu 


Ala 


Gly 


Tyr 


Arg 


Glu 


115 










120 


Asn 


Gly 


Arg 


Pro 


Arg 


Glu 


13 0 










135 


Tyr 


He 


Met 


Gin 


Asp 


Asp 


145 










150 


Glu 


Ala 


M^t 


Met. 


Val 


Ser 


160 










165 


Asp 


Val 


Lys 


Asn 


Asp 


Leii 


175 


• 








180 


Leu 


Met 


Ser 


Cys 


Ser 


His 


190 








195 


Gin 


Arg 


Lys 


Arg 


Leu 


Ala 


205 










210 


Pro 


Val 


Met 


Phe 


Phe 


Asp 


220 










225 


Ser 


Cys 


Phe 


Gin 


Val 


Val 


235 










240 


Gly 


Arg 


Thr 


He 


He 


Cys 


250 










255 


Phe 


Glu 


Met 


Phe 


Asp 


Lys 












2/0 


He 


Phe 


Lys 


Gly 


Val 


Val 


280 










285 


Leu 


Gly 


Leu 


liis 


Cys 


Pro 


295 










300 


He 


Glu 


Val 


Ala 


Ser 


Gly 
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305 310 315 

Glu Tyr Gly Asp Leu Asia Pro Met Leu Phe Arg Ala Val Gin Asn 

320 . ► 3-25 • . ' 330 ' 

Gly Leu cys Ala Met Ala Glu Lys Lys Ser Ser Pro Glu Lys Asn- 

■ • 335 . I ■ 340 i . 345, r 

Glu Val Pro Ala Pro Cys Pro Pro Cys , Pro Pro Glu Val Asp Pro 
I 350 355 360 

lie Glu Ser His Thr Phe Ala Thr Ser Thr. Leu Thr" Gin Phe Cys 
365 370 375 

He Leu Phe Lys Arg Thr Phe Leu Ser lie Leu Arg Asp Thr Val 
380 385 390 

Leu Thr His Leu Arg Phe Met Ser . His Val Val He' Gly Val Leu 
395 • 400 • 405 

He Gly Leu Leu Tyr Leu His He Gly Asp Asp 'Ala Ser Lys Val 
410 415 420 

Phe Asn Asn Thr 'Gly Cys Leu Phe Phe Ser Met Leu Phe Leu Met 
425 430 . 435 

Phe Ala Ala Leu Met Pro Thr yal Leu Thr Val Pro Leu Glu Met 
440' 445 450 

Ala Val Phe Met Arg Glu His Leu Asn Tyr Trp Tyr Ser Leu Lys 
I - 455 .460 465 

Ala Tyr Tyr Leu Ala. Lys Thr Met Ala Asp Val Pro Phe Gin Val 
,47 0 475 . • 480 

Val Cys Pro Val Val Tyr Cys ser He Val Tyr. Trp Mset Thr Gly 
485 490 495 

Gin Pro Ala Glu .Thr Ser Arg Phe Leu Leu Phe Ser Ala Leu Ala 
500 , ' ' ■ 505 . 510 



Thr 


Ala 


Thr 


Ala 


Leu 


Val Ala Gin Ser Leu Gly Leu Leu' 


He Gly 










515 


520 


525 


Ala 


Ala 


Ser 


Asn 


Ser Leu Gin Val Ala Thr Phe Val Gly »Pro Val 










53 0 


• 535 


540 


Thr 


Ala 


He 


Pro Val 


Leu Leu Phe Ser Gly Phe Phe Val 


Ser Phe 










545 


550 


555 


Lys 


Thr 


He 


Pro Thr 


Tyr Leu Gin Trp Ser Ser Tyr Leu 


Ser Tyr 








560 


565 


57 0 


Val 


Arg 


Tyr 


Gly 


Phe 


Glu Gly Val He Leu Thr He Tyr 


Gly Met 




575 


5B0 


585 


Glu Arg Gly Asp 


LeU" Thr Cys Leu Glu Glu Arg Cys Pro 


Phe Arg 










590 


595 


600 


Glu 


Pro 


Gin 


Ser 


He 


Leu Arg Ala Leu Asp Val Glu Asp Ala Lys 










605 


610 


615 


Leu 


Tyr 


Met 


Asp 


Phe 


Leu Val Leu Gly He Phe Phe Leu 


Ala Leu 






620 


625 


630 


Arg 


Leu 


Leu 


Ala 


Tyr 


Leu Val Leu Arg Tyr Arg Val Lys 


Ser Glu 








635 


640 


645 



Arg 
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<40D> 4 



Met 


Ala 


Val 


Cys 


Ala 


Lys 


Lys 


Arg 


Pro 


Pro 


Glu Glu Glu Arg Arg 


1 






5 










10 










15 


Ala 


Arg 


Ala 


Asn 


Asp 


Arg 


Glu 


Tyr Asn 


Glu 


Lys 


Phe 


Gin 


Tyr 


Ala 








20 










25 










30 


Ser Asn 


Cys 


He 


Lys 


Thr 


Ser 


Lys 


Tyr 


Asn 


He 


Leu 


Thr 


Phe 


Leu 








35 










40 










45 


Pro 


Val 


Asn 


Leu 


Phe 


Glu 


Gin 


Phe 


Gin 


Glu 


Val 


Ala 


Asn 


Thr 


Tyr 






Phe' 




50 










55 










60 


Phe 


Leu 


Leu 


Leu 


He 


Leu 


Gin 


Leu 


He 


Pro 


Gin 


He 


Ser 


Ser 
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65 






Leu 


Ser Trp 


Phe 


Thr 


Thr 


He Val 


•II 






80 






lie 


Thr Ala 


Val 


Lys 


Asp 


Ala Thr 








95 




Ser 


Asp Asn Gin 


Val 


Asn 


Asn Arg 




110 




. Gly 


He Leu Gin 


Gin 


Glu 


Gin Trp 








125 




lie 


He Lys 


Leu 


Glu 
140 


Asn 


Asn Gin 


Leu 


Leu • Ser 


Ser 


Ser 

155. 


Glu 


Pro His 


Ala 


Glu Leu Asp 


Gly 


Glu 


Thr Asn 








170 






Pro 


Val Thr 


Ser 


Glu 


Leu 


Gly Asp 








185 




Asp 


Gly Glu Val 


He 


Cys 


Glu Pro 








200 






Ser Gly Thr 


Leu 


Tyr 


Trp , Lys 








215 




Asn 


Gin Asn 


Met 


Leu 


Leu 


Arg Gly 








23 0 




1 Trp 


Cys Phe Gly 


Leu 


Val. 


lie Phe 








245 






Met 


Gin Asn 


Ser 


Gly 


Arg 


Thr Lys 








260 




Arg 


Leu Met 


Asn 


Thr 
275 


Leu 


Val Leu 


cys 


Met Gly Val 


lie 


Leu 


Ala He 








290 






Glu 


val Gly Met 


Arg 


Phe 


Gin Val 








305 






Val 


Asp Ser 


Ala 


Phe 


Phe 


Ser Gly 








320 




lie 


He He 


Leu 


Asn 
335 


Thr 


Val Val 


Val 


Glu Val 


He 


Arg 


Leu 


Gly His 








350 




Lys 


Lys Met 


Phe 


cys 
365 


Met 


Lys Lys 


Thr 


Thr Thr 


Leu 


Asn 
380 


Glu 


Glu Leu 


Ser 


Asp Lys 


Thr 


Gly 
395 


Thr 


Leu Thr 


Lys 


Cys Ser 


He 


Asn 
410 


Gly 


His Ser 


Jieu 


Gly His 


Lys 


Ala 
425 


Glu 


Leu Gly 


Phe 


Ser Phe 


Asn 


Pro 


Leu 


Ala Asp 








440 




Pro 


Ser . Leu 


Leu 


Glu. 


Ala 


Val Lys 








455 




Glu 


Phe Phe Arg. 


Leu 


Leu 


Ser Leu 








470 






Glu 


Lys Asn 


Glu 


Gly 
485 


Glu 


Leu Tyr 


Glu 


Gly Ala Leu 


Val 


Thr 


Ala Ala 








500 






Arg 


Ser Arg Thr 


Pro 


Lys 


Thr He 








515 




Ala 


He Thr 


Tyr 


Glh 
53 0 


Leu 


Leu Ala 


Arg 


Lys Arg Met 


Ser 


Val 


He Val 








545 






Arg 


Leu Ty^: 


Cys 


Lys 
560 


Gly 


Ala Asp 
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70 






75 


Pro 


Leu. 


Val Leu Val 


Leu 


Thr 




85 




90 


Asp 


Asp 


Tyr Phe .Arg His 


Lys 




IQO 






105 


Gin 


Ser Gin Val Leu 


He 


Asn 




115 






120 


Met 


iVsn 


Val Cirs Val 


Gly 


Asp 




130 






135 


Phe 


Val Ala Ala Asp Leu 


Leu 




145 






150 


Gly 


Leu Cys Tyr He 


Glu 


Thr 




160 






165 


Met 


Lys Val Arg Gin Ala 


He 




175 






180 


He 


Ser 


Lys Leu Ala 


Lys 


Phe 




190 






195 


Pro 


Asn Asn. hys Leu Asp 


Lys 




205 






210 


Glu 


Asn Lys Phe Pro 


Leu 


^er 




220 






225 


Cys 


Val Leu Arg Asn Thr 


Glu 




235 






240 


Ala 


Gly Pro Asp Thr Lys 


Leu 




250 






255 


Phe 


Lys 


Arg Thr Ser 


.He 


Asp 




265 






270 


Trp 


He 


Phe Gly Phe Leu 


Val 




280 






285 


Gly 


Asn 


Ala He Trp Glu 


His 




295 


1. 




300 


Tyr 


Leu 


Pro Trp Asp 


Glu 


Ala 




310 




315 


Phe 


Leu 


Ser Phe Trp 


Ser 


Tyr 




325 




330 


Pro 


He 


Ser Leu Tyr Val 


Ser 




340 






345 


Ser 


Tyr 


Phe He Asn 


Trp 


Asp 




355 






360 


Arg 


Thr 


Pro Ala Glu 


Ala 


Arg 




370 






375 


Gly 


Gin 


Val Glu Tyr 


He 


Phe 




385 




390 


Gin 


Asn 


He Met Val 


Phe 


Asn 




400 






405 


Type 


Gly 


Asp V.al Phe 


Asp 


Val 


415 






420 


Glu 


Arg 


Pro Glu Pro 


Val 


Asp 




430 






435 


Lys 


Lys 


Phe Leu Phe 


Trp 


Asp 




445 






450 


He 


Gly 


Asp Pro His 


Thr 


His 




460 






465 


Cys 


His 


Thr Val Met 


Ser 


Glu 


475 






480 


Tyr 


Lys 


Ala Gin Ser 


Pro 


Asp 




490 






495 


Arg 


Asn 


Phe Gly Phe Val 


Phe 




505 






510 


Thr 


Val 


His Glu Met 


Gly 


Thr 


He 


520 




525 


Leu 


A$p Phe Asn Asn 


He 




535 






540 


Arg 


Asn 


Pro Glu Gly 




He 




550 




555 


Thr 


He 


Leu Leu Asp 


Arg 


Leu 




565 




570 
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His 


His 


Ser 


Thr Gin 


Glu 


Leu Leu Asn 


Thr 


Thr 


Met 


Asp 


His 


Leu 








575 






580 








585 


Asn 


Glu 


a?yr Ala Gly 


Glu Gly Leu Arg 


Thr 


Leu 


Val 


Leu 


Ala 


Tyr 








590 






595 










600 


Lys 


Asp 


Leu- Asp Glu 


Glu 


Tyr Tyr 'Glu 


Glu Trp 


Ala 


Glu 


Arg 


Arg 




1 




605 






610 










615 


Leu 


Gin 


Ala 


Ser Leu 


Ala Gin Asp Ser 


Arg Glu 


A$p 


Ajrg 


Leu 


Ala 








620 






625 








630 


Ser 


lie 


Tyr 


Glu Glu 


Val 


Glu Asn Asn 


Met 


Met 


Leu 


Leu 


Gly 


Ala 








635 






640 








645 


Tnr 


Ala 


.He Glu Asp 


Lys 


Leu Gin Gin 


Gly Val 


Pro 


Glu 


Thr 


lie 








650 






655 








• 


660 




Leu. 


Leu 


Thr Leu 


Ala Asn He Lys 


He Trp 


Vai 


Leu 


Thr 


Gly 




Lys 




665 






670 










. 675 


■ ASp 


Gin 


Glu Ohr 


Ala 


Val Asn lie 


Gly Tyr 


Ser 


Cys 


Lys 


Met 








680 






685 






690 


Leu 


Tnr 


Asp Asp Met 


Thr Glu Val Phe 


lie 


Val 


Thr 


Gly 


His 


Thr 








695 






7 00 








7Q5 


ival 


Leu 


Glu Val Arg 


Glu 


Glu Leu Arg 


Lys 


Ala 


Arg 


Glu 


Lys 


Met 








710 






715 






720 


Met 


Asp 


Ser 


Ser Arg 


Ser 


Val Gly Asn 


Gly 


Phe 


Thr 


Tyr 


Gin 


Asp 








725 






730 










735 


Lys 


Leu 


Ser 


Ser Ser 


Lys 


Leu Thr Ser 


Val 


Leu 


Glu 


Ala 


Val 


Ala 








740 






745 










750 


Gly 


Glu 


Tyr 


Ala Leu 


Val 


lie Asn Gly 


His 


Ser 


Leu 


Ala 


His 


Ala 








755 






760 










765 


Leu 


Glu 


Ala Asp Met 


Glu 


Leu Glu Phe 


Leu. 


Glu 


Thr 


Ala 


Cys 


Ala 








770 






775 








780 


Cys 


Lys 


Ala 


Val He 


Cys 


,Cys Arg Val 


Thr 


Pro 


Leu 


Gin 


Lys 


Ala 








785 






790 








795 


Gin 


Val 


Val 


Glu Leu 


Val 


Lys Lys Tyr 


Lys 


Lys 


Ala 


Val 


Thr 


Leu 








800 






805 










810 


Ala 


lie 


Gly Asp Gly 


Ala 


Asn Asp Val 


Ser 


Met 


He 


Lys 


Thr 


Ala 








815 






820 








825 


His 


lie 


Gly Val Gly 


He 


Ser Gly Gin 


Glu 


Gly 


He 


Gin 


Ala 


Val 








830 






835 








840 


Leu 


Ala 


Ser 


Asp Tyr 


Ser 


Phe Ser Gin 


Phe 


Lys 


Phe 


Leu 


Gin 


Arg 








845 






850 








855 


Leu 


Leu 


Leu 


Val His 


Gly 


Arg Trp Ser 


Tyr 


Leu 


Arg 


Met 


Cys 


Lys 








860 






865 










870 


Phe 


Leu 


Cys 


Tyr Phe 


Phe 


Tyr Lys Asn 


Phe 


Ala 


Phe 


Thr 


Met 


Val 








875 






880 










885 


His 


Phe 


Trp 


Phe Gly 


Phe 


Phe Cys Gly 


Phe 


Set- 


Ala 


Gin 


Thr 


Val 






890 




895 










900 


Tyr 


Asp 


Gin Tyr Phe 


ile 


Thr Leu Tyr 


Asn 


He 


Val 


Tyr- 


Thr 


Ser 






905 






910 








915 


Leu 


Pro 


Val 


Leu Ala 


Met 


Gly Val Phe 


Asp 


Gin 


Asp 


Val 


Pro 


Glu 








920 






925 








930 


Gin 


Arg 


Ser 


Met- Glu 


Tyr 


Pro Lys Leu 


Tyr 


Glu 


Pro 


Gly 


Gin 


Leu 








935 






940 








945 


Asn 


Leu 


heu 


Phe Asn 


Lys 


Arg Glu Phe 


Phe 


He 


Cys 


He 


Ala 


Gin 








950 






955 








960 


Gly 


He 


Tyr 


Thr Ser 


Val 


Leu Met Phe 


Phe 


Ile 


Pro 


Tyr 


Gly 


Val 








965 






970 






975 


Phe 


Ala 


Asp 


Ala Thr 


Arg 


Asp Asp Gly 


Thr 


Gin 


Leu 


Ala 


Asp 


Tyx 








980 






985 








990 


Gin 


Ser 


Phe 


Ala Val 


Thr 


Val Ala Thr 


Ser 


Leu 


Val 


He 


Val 


Val 




Val 




995 




1000 








1005 


Ser 


Gin 


He Gly 


Leu 


Asp Thr Gly Tyr 


Trp 


Thr 


Ala 


lie 


Asn 








1010 




1015 






1020 


His 


Phe 


Phe 


He Trp 


Gly 


Ser Leu Ala 


Val 


Tyr 


Phe 


Ala 


lie 


Leu 








1025 




1030 








1035 


Phe 


Ala 


Met 


His Ser 


Asn 


Gly Leu Phe 


Asp 


Met 


Phe 


Pro 


Asn 


Gin 








1040 




1045 








1050 


Phe 


Arg 


Phe 


Val Gly 


Asn 


Ala Gin Asn 


Thr 


Leu 


Ala 


Gin 


Pro 


Thr 








1055 




1060 








1065 


Val 


Trp 


Leu 


Thr He 


Val 


Leu Thr Thr 


Val 


Val 


Cys 


lie 


Met 


Pro 
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1070 1075 Ipso 

Vail Val Ala Phe Arg Phe Leu Arg Leu Asn Leu Lys Pro Asp Leu 

1085,^ " H.. „ iq90 . . 1095 . . 

Ser Asp Thr Val Arg Tyr Thr Gin Leu Val Atg Lys Lys Gin Lys 

1100 ^ 1105 IIIO' 

Ala Gin His Arg Cys het A3^g Arg Val Gly Arg thr Gly Ser Arg ^ 

, 1115 ■ 1120 1125 

Arg Ser Gly Tyr Ala Phe Ser His Gin Glu Gly Phe>Qly Glu Leu 

1130 1135 1140 . 

lie Met ser Gly Lys Asn Met Arg Leu Ser Ser Leu Ala Leu Ser 
. 1145 1150 . 1155 

Ser Phe Thr Thr Arg Ser Ser Ser Ser Trp lie Glu Ser Leu Arg 

1160 , 1165 , 1170 

Arg Lys Lys Ser Asp Ser Ala Ser Ser Pro Ser iGly Gly Ala Asp . ' » 

1175 ' 1180 1185 

Lys Pro Leu Lys Gly 

il90 

t " . • . ' • . 

<210> 5 • I * 

<211> 467 ' 
<212> PRT, 
<213> Homo sapiens 

<220> ' . I 

<221> mis cofeature ' , 

<223> Incyte ID No: 7473347CDX 

<400> 5 • ♦ , 



Met 


Val 


Leu 


Ala 


Phe 


Gin 


Leu 


Val 


Ser 


Phe 


Thr 


Tyr 


He 


Trp He 


1 






5 










10 








15 


He 


Leu 


Lys 


Pro 


Asn 


Val 


Cys 


Ala 


Ala 


Ser 


Asn 


He 


Lys Met Thr 








20 










25 








30 


His 


Gin 


Arg 


Cys 


Ser 


Ser 


Ser 


Met 


Lys 


Gin 


Thr 


Cys 


Lys 


Gin Glu 










35 










40 








45 


Thr 


Arg 


Met 


Lys 


Lys 


Asp 


Asp 


Ser 


Thr 


Lys 


Ala 


Arg 


Pro 


Gin Lys 










50 










55 








60 


Tyr 


Glu 


Gin 


Leu 


Leu 


His 


lie 


Glu 


Asp 


Asn 


Asp 


Phe. 


Ala 


Met Arg 










65 










70 








75 


Pro 


Gly 


Phe Gly Gly 


Ser 


pro 


Val 


Pro 


Val 


Gly 


He 


Asp 


Val His 










80 










85 








90 


Val 


Glu 


Ser 


He 


Asp 


Ser 


He 


Ser 


Glu 


Thr 


Asn 


Met 


Asp 


Phe Thr 










95 










100 






105 


Met 


Thr 


Phe 


Tyr 


Leu 


Arg 


His 


Tyr 


Trp 


Lys 


Asp 


Glu 


Arg 


Leu Ser 










110 










115 








120 


Phe 


Pro 


Ser 


Thr 


Ala 


Asn 


Lys 


Ser Met 


Thr 


Phe Asp 


His 


Arg Leu 










125 








13 0 








135 


Thr 


Arg 


Lys 


He Trp Val 


Pro 


Asp 


He 


Phe 


Phe . Val 


His 


Ser Lys 










140 










145 








150 


Arg 


Ser 


Phe 


He 


His 


Asp 


Thr 


Thr 


Met 


Glu 


Asn 


lie 


Met. 


Leu Arg 










155 










160 








165 


Val 


His 


Pro Asp 


Gly Asn 


Val 


Leu 


Leu 


Ser 


Leu Arg 


He 


Thr Val 










170 










175 








180 


Ser 


Ala 


Met 


Gys 


Phe 


Met 


Asp 


Phe 


Ser 


Arg 


Phe Pro 


Leu 


Asp Thr 










185 










190 








195 


Gin 


Asn 


Cys 


Ser 


Leu 


Glu 


Leu 


Glu 


Ser 


Tyr 


Ala Tyr 


Asn Glu Asp 








200 










.205 








210 


Asp 


Leu 


Met 


Leu Tyr 


Trp 


Lys His Gly Asn Lys 


Ser 


Leu 


Asn Thr 










215 










220 








225 


Glu 


Glu 


His 


Met 


Ser 


Leu 


Ser 


Gin 


Phe 


Phe 


He 


Glu 


Asp 


Phe Ser 










230 










235 








240 


Ala 


Ser 


Ser Gly Leu Ala 


Phe 


Tyr 


Ser 


Ser 


Thr Gly 


Trp 


Tyr Asn 










245 










250 








255 


Arg 


Leu 


Phe 


He 


He 


Ser 


Val 


Leu Arg Turg 


His 


Val 


Phe 


Phe Phe 








260 










265 








270 


Val 


Leu 


Pro 


Thr 


Tyr 


Tyr 


Pro 


Ala 


He 


Leu 


Met 


Val 


Met 


Leu Ser 










275 










280 








285 


Trp 


Val 


Ser 


Phe 


Trp 


He 


Asp Arg Arg Ala Val 


Pro 


Ala Arg Val 
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290 295 300 

Ser Leu Gly lie Thr Thr Val Leu Thr Met Ser Thr lie lie Thr 

305f » 310 • . • ' 315 • 

Ala Val Ser Ala Ser Met Pro Gin Val Ser Tyx Leu Lys Ala Val- 

3.20 . I 325 . 33 q r 

Asp Val Tyr Leu Tirp Val Ser Ser Leu |>he Val Phe Leu Ser Val 

335 340 345 

lie Glu Tyir Ala Ala Val Asn Tyr Leu Tin: Thr ValVGlu Glu Arg 

350 355 . 360 

Lys Gin Phe Lys Lys TOir Gly Lys He Ser Arg Met Tyr Asn He 

365 370 375 

Asp Ala Val Gin Ala Met Ala Phe: Asp Gly Cys Tyr' His Asp Ser 

380 » 385 ' 390 

Glu He Asp Met Asp Gin Thr Ser L^ii Ser Leu *Asn Ser Glu Asp 

395 • 400 405 

Phe Met Arg Arg 'Lys Ser He Cys Ser Pro Ser Thr Asp Ser Ser 

410 , 415 . . 420 

Arg He Lys Arg Arg Lys Ser ,Leu Gly Gly His Val Gly Arg .He 

425' 430 435 

He Leu Glu Asn Asn His Val He Asp Thr Tyr Ser Arg He Leu 
• . 440 • .445 . 450 

Phe Pro lie Val Tyr He Leu Phe Asn Leu Phe Tyr Trp Gly Val 

455 460 . • 4.65 

Tyr Val i . . 

<210> 6 , * ' ■ 

<211> 1196 - • • * 

<212> PRT ' . . . 

<213> Homo sapiens * 

■ ■ t 

<220> 

<221> inisc_f eature 

<223> Xncyte ID No: 7474240CD1 

<400> 6 • . 

Met Pro Val Arg Arg Gly His Val Ala Pro Gin Asn' Thr Phe Leu 
1 .5 10 15 

Gly Thr He He Arg Lys Phe Glu Gly Gin Asn Lya Lys Phe Ile. 

. 20 25 . 30 

He Ala Asn Ala Arg Val Gin Asn Cys Ala He He Tyr Cys Asn 
35 40 . 45 

Asp Gly Phe Cys Glu Met Thr Gly Phe Ser Arg Prq Asp Val Met 
50 55 ' 60 

Gin Lys Pro Cys Thr Cys Asp Phe Leu His Gly Pro Glu Thr Lys 
65 . 70 75 

Arg His Asp He Ala Gin He Ala Gin Ala Leu lifeu Gly Ser Glu 
80 85 90 

Glu Arg Lys Val Glu Val Thr Tyr Tyr His Lys Asn Gly Ser Thr 
95 10.0 105 

Phe He Cys Asn Thr His He He . Pro Val Lys Asn Gin Glu Gly 

lie 115 120 

Val Ala Met Met Phe He He Asn Phe Glu Tyr Val Tlxr Asp Asn 

125 130. 135 

Glu Asn Ala Ala Thr Pro iSlu Arg Val Asn Pro He. Leu Pro He 

140 145 150 

Lys Thr Val Asn Arg Lys Phe Phe Gly Phe Lys Phe Pro Gly Leu 

155 160 165 

Arg Val Leu Thr Tyr Arg Lys Gin Ser Leu Pro Gin Glu Asp Pro 

170 175 180 

Asp Val Val Val He Asp Ser Ser Lys His Ser Asp Asp Ser Val 

185 190 195 

Ala Met Lys His Phe Lys Ser Pro Thr Lys Glu Ser Cys Ser Pro 

200 205 210 

Ser Glu Ala Asp Asp Thr Lys Ala Leu He Gin Pro Ser Lys Cys 

215 220 225 

Ser Pro Leu Val Asn He Ser Gly Pro Leu Asp His Ser Ser Pro 
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230 235 240 





Axg 


\3±TX 


Trp 


Asp 


Arg 


Leu 


Tyr 


Pro 


Asp 


Met 


Leu 


Gin 


Ser 


Ser 




















o c n 








255 






Leu 


Ser 


HIS 


Ser 


■ 

Arg 


Ser 


Arg 


Glu 


Ser 


Leu .Cys 


Ser 


He 










^ OU 










265 


Gly 








270 




AX 9" 


Axa 


Ser 


oer 


vai 


rllS 


Asp 


'lie 


Glu 


I>he Gly 


Val 


His 




1 






^ / o 










280 










285 


±r -L vj 


JLiys 


Asn 


in A 
lie 


Jrne 


Arg 


Asp 


Arg 


His 


Ala 


Ser 


Glu Asp 


Asn 


Gly 










on 










295 










300 


Ax y 


TV CSV* 


vajL. 


Lys 


v^iy 


Pro 


pne 


Asn 


His lie 


T ^ rev ■ 

Lys 


Ser 


ser 


Leu 


Leu 










305 










310 








315 




Sex 


'(T>Vk V 

inj^ 


Ser 


Asp 


Ser 


Ash 


Leu 


Asn Lys 


Tyr 


Ser 


Thr 


He 


Asn 










•a OA 


. Thr 








325 










330 


Lys 


j.xe 


Pxo 


Gin 


Leu 


Leu 


TV MM 

Asn 


Phe 


Ser 


(^lu 


Val 


Lys 


Thr 


Glu 




















340 






345 


jjy s 


Lys 


Asn 


Ser . 


Ser 
•5 t; n 

O D U 


Pro 


Pro 


Ser 


Ser 


Asp 
355 


Lys 


Thr 


He 


He 


Ala 
3 60 


Pro 


Lys 


vai 


Lys 


Asp 


Arg 


inr 


HIS 


Asn 


Val 


Thr. 


Glu 


Lys 


Val 


Thr 










3 65 










370 








375 


ya 1 


Leii 


Ser 


Leu 


Gly 


Ala 


,Asp 


Val 


Leu 


Pro 


Glu 


Tyr 


Lys. 


Leu 










380 










385 






390 






Pxo 


Arg 


lie 


TV MM 

Asn 


Lys 


Til*. ^ 

Phe 


Thr 


He 


Leu 


His 


Tyr 


Ser 


Pro 










•a o c 










400 








405 


Pile 


Lys 


TV 1 

Ala 


Val 


Trp 
410 


Asp 


T3rp 


Leu 


lie 


Leu 
415 


Leu 


Leu 


Val 


He 


Tyr 
420 


Thar 


Ala 


lie 


Pne 


Thr 
425 


Pro 


Tyr 


Ser 


Ala 


Ala 
430 


Phe 


Leu. 


Leu 


Asn 


Asp 
435 


Arg 


GlU 


Glu 


Gin 


Lys 


Arg 


Arg 


Glu 


Cys 


Gly 


Tyr 


Ser 


Cys 


Ser 


Pro 










A A n 
440 










445 








450 


lieu 


Asn 


Val 


Val 


Asp 
455 


Leu 


He 


Va;i 


Asp 


He 
460 


Met 


Phie 


He 


He 


Asp 
465 


lie 


Leu 


He 


Asn 


Phe 
470 


Arg 


Thr 


Thr 


Tyr 


Val 
475 


Asn 


Gin 


Asn 


Glu 


Glu 
480 


Val 


Val 


Ser Asp 


Pro 


Ala 


Lys 


He 


Ala 


He 


His 


Tyr 


Phe 


Lys 


Gly 










485 










490 








495 


Trp 


Phe 


Leu 


He 


Asp 


Met 


Val 


Ala 


Ala 


He 


Pro 


Phe 


Asp 


Leu 


Leu 










500 










505 








510 


He 


Phe 


Gly 


Ser 


Gly 


Ser 


Asp 


Glu 


Thr 


Thr 


Thr 


Leu 


He 


Gly 


Leu 










515 










520 








525 


Leu 


Lys 


Thr 


Ala 


Arg 


Leu 

■ 


Leu 


Arg 


Leu 


Val 
535 


Arg 


Val 


Ala 


Arg 


Lys 
540 . 


lieu 


Asp 


Arg 


Tyr 


Ser 


Glii 


Tyr 


Gly Ala Ala 


Val 


Leu 


Met 


Leu 


Leu 




















550 










555 


jMec 


Cys 


lie 


Phe 


Ala 


Leu 


He 


Ala 


His 


Trp 


Leu 


Ala 


Cys 


He 


Trp 










C A 




• 






565 








570 


Ty3r 


Ala 


He 


Gly 


Asn 


val 


Glu 


Arg 


Pro 


Tyr 


Leu 


Thr 


Asp 


Lys 


He 










575 




• 






580 






585 




Tjrp 


Leu Asp 


Ser 


Leu 


Gly 


Gin 


Gin 


He 


Gly 


Lys Arg 


Tyr 


Asn 










by u 










595 








600 


Asp 


Sex* 


Asp 


Ser 


Ser 


Ser 


Gly 


Pro 


Ser 


He 


Lys 


Asp 


Lys 


Tyr 


Val 










605 










610 








615 


injr 


Aia 


Leu 


Tyr 


pne 


Thr 


Pne 


Ser 


Ser 


Leu 


Thr 


Ser 


Val 


Gly 


Phe , 










U 










625 








630 


«jj.y 


Asn 


Val 


Ser. 


Pro 


Asn 


Thr 


Asn 


Ser Glu 


Lys 


He 


Phe 


Ser 


He 










O JO 










640 








645 


cys 


vai 


Met 


Leu 


lie 


Giy 


Ser 


Leu 


Met 


Tyr 
655 


Ala 


Ser 


He 


Phe 


Gly 
660 


ASXl 


vax 


Ser 


Ala 


lie 


XT A 

lie 


Gin 


Arg 


Leu 


Tyr 


Ser 


Gly Thr 


Ala 


Arg 










665 










670 










675 


Tyr 


xlis 


Met 


Gin 


jwec 


Leu 


Arg 


Val 


Lys 


Glu 


Pne 


He Arg 


Phe 


His 










680 










685 












Gin 


He 


Pro 


Asn 


Pro 


Leu 


Arg 


Gin Arg Leu 


Glu 


Glu 


Tyr 


Phe 


Gin 










695 










700 








705 


His 


Ala 


Trp 


Thr 


Tyr 
710 


Thr 


Asn 


Gly 


He 


Asp 
715 


Met 


A^n 


Met 


Val 


Leu 
720 


Lys 


Gly 


Phe 


Pro 


Glu 


Cys 


Leu 


Gin Ala Asp 


He 


Cys 


Leu 


His 


Leu 



725 730 735 
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Asn 


Gin 


Thr 


Leu 


Leu 


vvxn Asn 










740 




Dys 


Gly cys 


Leu 


Arg 


Ala Leu 










.755 




AXck. 


Pro 


Pro- 


Gly Asp 


xnr Lieu 










•77 rt 

/ / u 




AX a 


L6XI 


Tyr 






Ser Arg 










/ OO 




Asp 


He 


Val 


VAX 




lie Leu 










ft n n 






Val 


His 




xyr 


AX a ijys 










oXo 


At y 


Ala 


Leu 


Xtir 


Tyr 


TO Hi. av\ 

v-^^ys Asp 










o J U 






Leu 


Glu 


vax 


Leu 


risp neu 














Thr Asn 


• 

Leu 


I7XU 


jjeu xjjir 










qOU 


,ijys 


Ala Asp 


Leu 


Leu 


Arg b er 










87 b 




Asp 


Asn 


Cys 


Lys 


. Leu 


Arg Arg 










0 1\ f\ 
890 






Glu 


Lys 


GXU 


Ash 


Ser Thr 










905 




TniT 


He 


Arg 


His 


Tyr 


Gin Ser 










920 




Lys 


Ser Arg 


Ser 


Ser 


Ser Phe 










935 




Lys 


Pro 


Leu 


Phe 


Ser 


Gxy xxe 










950 




T n m 

Liys 


Ala 


Ser 


Gly 


Leu 


Asp Phe 










965 




Arg 


Met 


His 


lie 


Asp 


Lys Arg 










980 




Asp 


Met 


Arg 


Ser 


Tip 


Glu Arg 










995 




Asp 


Ser 


Ser 


Pro 


Ser 


Ala Leu 










1010 






Thr 


Glu 


Ser 


Asp 


Leu Thr 










1025 




Asp 


Leu 


Leu 


Gxn 


Glu 


Gxn Leu 










1040 






Asp. 


He 


Gin 


Thr 


He 'Leu 


t 








1055 




vax 


Val 


Pro 


Pro 


Ala 


- 

Tyr Ser 










1070 


■ 




Arg 


Pro 


He 


He 


Gin Leu 










1085 




-Sex* 


He Lys 


Thr 


Asp 


Arg Ser 










1100 




v»Xu. 


Phe Leu 


Asp 


Leu 


Glu Lys 










1115 






Ser 


Ser 


Gly Val 


rlxS Leu 










1130 






Ser 


Leu 


Leu 


Lys 


LyXn ASp 










1145 


ij6U 


Arg Gin 


Arg 


Lys 


mr xyr 










1160 




Leu 


Pro 


Asp 


Ser 


Ser 


Leu Ser 










1175 




Arg 


His 


Val 


Ser Asp 


Pro Gly 










1190 





<210> 7 
<211> 512 
<212> PRT 

<213> Homo sapiens 



Cys Lys Ala 


Phe 


Arg Gly 


Ala Ser 


745 


Phe 








Ala Met Lys 


Lys 


Thr 


Thr His 


• 760 








7o5 


Val. His cys Gly 


Asp 


V^l 


Leu Thr 


lis 








/ oO 


Gly Ser He Glu 


T?.e 


Leu 


Lys Asp 


790 








■7 Q C 


Gly Lys Asn Asp 


He 


Phe 


Gxy Gxu 


805 








810 


Pro .Gly Lys 


Ser 


Asn 


Ala 


Asp Val 


' 820 










Leu His Lys 


He 


Gin 


Arg 


Glu Asp 


835 


1 




1 


o40 


Tyr Pro Glu 


Phe 


Ser 


Asp 


His Phe 


850 






0 c c 


Phe Asn Leu 


Arg 


His 


Glu 


Ser Ala 


865 




Asp 




0 / O 


Gin Ser Met 


Asn. 


Ser 


Glu Gly 


880 








885 


Arg Lys Leu 


Ser 


Phe. 


Glu 


Ser Glu 


895 








900 


Asn Asp Pro 


Glu 


Asp 


Ser 


Ala Asp 


910 








915 


Ser Lys Arg 


His 


Phe 


Glu 


Glu Lys 


925 








930. 


He Ser Ser, He 


Asp 


Asp 


Glu Gin 


940 








945 


Val Asp Ser 


Ser 


Pro 


Gly 


He Gly 


955 








960 


Glu Glu Thr 


Val 


Pro 


Thr 


Ser Gly 


970 








975 


Ser His Ser 


Cys 


Lys 


Asp 


He Thr 


985 








990 


Glu Asn Ala 


His 


Pro 


iSln 


Pro Glu 


1000 








1005 


Gin Arg Ala Ala 


Trp 


Gly 


He Ser 


1015. 








1020 


Tyr Gly Glu Val 


Glu 


Gin 


Arg Leu 


1030 








1035 


Asn Arg Leu 


Glu 


Ser 


Gin 


Met Thr 


1045 








1050 


Gin Leu Leu 


Gin 


Lys 


Qln 


Thr Thr 


1060 






1065 


Met Val Thr 


Ala 


Gly 


Ser- 


Glu Tyr 


. 1075 








1 080 


Met Arg Thr 


Ser 


.din 


Pro 


Glu Ala 


1090 








1095 


Phe Ser Pro 


Ser 


Ser 


Gin 


Cys Pro 


1105 








1110 


Ser Lys Leu 


Lys 


Ser 


Lys 


Glu Ser 


1120 








1125 


Asn Thr Ala 


Ser 


Glu 


Asp 


Asn Leu 


1135 




■ 


1140 


Ser Asp Leu 


Ser 


Leu 


V^XU 


Leu His 


1150 








X X33 


Val His Pro 


He 


Arg 


His 


Pro Ser 


1165 






1170 


Thr Val Gly 


He 


Val 


Gly 


Leu His 


1180 








1185 


Leu Pro Gly 


Lys 









H95 
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<220> . 

<221> mis cofeature 

<223> Incyte ID No : » 747^338eDl ' ' * 

<400> 7 . / • I , 

Met Glu Asn Lys Glu Ala Gly Thr Pro , pro Pro 116 Pro Ser Arg 
i • * 5 10 . . ■ 15 

Glu Gly Arg Leu Gin Pro Thr Leu Leu Leu Ala Thr' Leu Ser Ala 

20 25 ■ 30 ■ 

Ala Phe Gly Ser Ala Phe Gin Tyr Gly Tyr Asn Leu Ser Val Val 
35 ■ 40 45 

Asn Thr Pro His Lys Val Phe Lys- Ser Phe Tyr Asn' Glu Thr Tyr 
50 » 55 ' 60 

Phe Glu Arg His Ala Thr Phle Met Asp Gly Lys *Leu Met Leu Leu 
65 • 70 75 

Leu Trp Ser Cyis -Thr Val Ser Met Phe Pro Leu Gly Gly Leu Leu 
, 80 . 85 90 

Gly Ser Leu Leu Val Gly Leu ,Leu Val Asp Ser Cys Gly Arg .Lys 
95 100 105 

Gly Thr Leu Leu He Asn Asn He Phe Ala He He Pro Ala He 
• • lip . .115 120 

Leu Met Gly Val Ser Lys Val Ala Lys Ala Phe Glu Leu He Val 
. 125 130 . * 135 

Phe Ser Arg Val. Val Leu Gly Val Cys Ala Gly He Ser Ser 

140 145 . 150 

Ala Leu Pro Met Tyr Leu Gly Glu Leu Ala Pro Lys Asn Leu Arg 

155 . ♦ 160 165 

Gly Met Val Gly Thr Met Thr Glu Val Phe Val He Val 'Gly Val 
• 170. 175 180 

Phe Leu Ala Gin He Phe Ser Leu Gin Ala lie Leu Gly 'Asn Pro 

185 • 190 195 

Ala Gly Trp Pro Val Leu Lfeu Ala Leu Thr Gly Val Pro Ala Leu 

200 205 210 

Leu Gin Leu Leu Thr Leu Pro Phe Phe Pro Glu Ser Pro Arg Tyr 

215 220 225 

Ser Leu He Gin Lys Gly Asp Glu Ala Thr Ala Arg Gin Ala Leu . 

23 0 2'35 240 

Arg Arg Leu Arg Gly His Thr Asp Met Glu Ala Glu Leu Glu Asp 

245 250 255 

Met Arg Ala Glu Ala Arg Ala Glu Arg Ala Glu Gly His Leu Ser 

260 265 270 

Val Leu His Leu Cys Ala Leu Arg Ser Leu Arg Trp Gin Leu Leu 

275 280 . 285 

Ser He lie Val Leu Met Ala Gly Gin Gin Leu Ser Gly He Asn 

290 . 295 300 

Ala lie Asn Tyr oyr, Ala Asp Thr He Tyr Thr S.^r Ala Gly Val 

305 310 315 

Glu Ala Ala His Ser Gin Tyr Val Thr Val Gly Ser Gly Val Val 

320 325 330 

Asn He Val Met Thr lie Thr Ser Ala Val Leu Val Glu Arg Leu 

335 ' 340 345 

Gly Arg Arg His Leu Leu Leu Ala Gly Tyr Gly He Cys Gly Ser 

350 355 360 

Ala Cys Leu Val Leu Thr Val Val Leu Leu Phe Gin Asn Arg Val 

365 . 370 375 

Pro Glu Leu Ser Tyr Leu Gly He He Cys Val Phe Ala Tyr He 

380 385 390 

Ala Gly His Ser He . Gly pro Ser Pro Val Pro Ser Val Val Arg 

395 400 405 

Thr Glu He Phe Leu Gin Ser Ser . Arg Arg Ala Ala Phe Met Val 

410 415 420 

Asp Gly Ala Val His Trp Leu Thr Asn Phe He. He Gly Phe Leu 

425 430 435 

Phe Pro Ser He Gin Glu Ala He Gly Ala Tyr Ser Phe He He 

440 445 450 

Phe Ala Gly He Cys Leu Leu Thr Ala He Tyr He Tyr Val Val 

455 460 465 
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lie Pro Glu 


Thr 


Lys 


Gly Lys Thr 


Phe 


Val 


Glu He Asn 


Arg 


He 






470 








475 




480 


Phe Ala bys 


Arg 


Asn 


Arg Val Lys Leu 


Pro 


Glu Glu Lys 


Glu 


Glu 






.485 








490 




495 


Thr lie Asp Ala Gly 


Pro 


Pro Thr 


Ala 


Ser 


Pro Ala Lys 


Glu 


Thr 






500 








505 




510 


Ser Phe 




















<210> 8 




















<211> 568 
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<213> Homo sapiens 










, 1 
I ' 






<220> 




















<221> misc_feature 
















<223> Incyte ID 


No: 


7476747CP1 
































Met Thr Ala 


Ser 


Thr 


Pro 


Glu 'Ala 


Thr 


Pro 


Asn Met Glu 


Leu 


Lys 


J. 




5 








10 






15 


Ala Piro Ala 


Ala Gly 


Caxy 


Leu Asn 


Ala 


Gly Pro Val Pro 


Pro 


Ala 






20 






25 






30 


Ala Mec . ser 


Thr 


Gin 


Arg 


Leu Arg Asn 


Glu 


Asp Tyr His 


Asp 


Tyr 






35 






40 






45 


Ser- Ser Tnr 


Asp Val 


Ser 


Pro Glu Glu 


Ser 


Pro 3&t Glu 


Gly 


Leu 






50 








55 . 


60 


Asn Asn Leu 


Ser 


Ser 


Pro 


Gly Ser 


Tyr. 


Gin 


Arg Phe Gly 


Gin 


Ser 






65 








70 




75 


Asn Ser Tnr 


Thr 


Trp 


pne 


Gin Thr 


Leu 


He 


His Leu Leu 


Lys 


Gly 






80 








85 




90 


TV ^^^^ T ^ ^ ^ r 

iisn xjie Gxy 


Thr 


Gly 


Leu 


Leu Gly Leu 


Pro 


Leu Ala Vai. 


Lys 


Asn 






95 








100 






105 


KIs y^T^y Tl« 

Axa . (7j.y xxe 


Val 


Met 


t^jLy 


Pro He 


Ser 


Leu 


Leu ile lie 


Gly 


.He 






110 








115 




120 


VO.X AXA VQ.X 


His 


Cys 


Jneu 


Gly He 


Leu 


Val 


Lys Cys Ala 


His 


His 






125 








130 






135 


^T"/^ 

■xrXis \-ys AX:^ 


Arg 


Leu 


Asn 


Lys Ser 


Phe 


Val 


Asp Tyr Gly 


Asp 


Thr 






140 






145 






150 


Val Met Tvr* 


Gly 


Leu 


Glu 


Ser Ser 


Pro 


Cys 


Ser Trp Leu 


Arg 


Asn 






155 








160 






165 


His Ala His 


Trp Gly 


■T^j- y 


Arg Val 


Val 


Asp 


Phe Phe Leu 


lie 


Val 


Thir Gin Leu. 




170 








175 






180 


Gly 


Phe 


Cys 


Cys Val 


Tyr 


Phe 


Val Phe Leu 


Ala 


Asp 






185 








190 






195 


Asn Phe Lys 


Gin 


Val 


He 


Glu Ala 


Ala 


Asn 


Gly Thr Thr 


Asn 


Asn 






200 








205 




210 


Cys His Asn 


Asn 


Glu 


Thr 


Val He 


Leu 


Thr 


Pro Thr Met 


Asp 


Ser 






215 








220 




225 


Arg Leu ^.Tyr 


Met 


Leu 


Ser 


Phe Leu 


Pro 


Phe 


Leu Val Leu 


Leu 


Val 




230 








235 






240 


Phe lie Arg 


Asn 


Leu 


Arg 


Ala Leu 


Ser 


He 


Phe Ser Leu 


Leu 


Ala 






245 








250 






255 


Asn lie Thr 


Met 


Leu 


Val 


Ser Leu 


Val 


Met 


He Tyr Gin 


Phe 


He 






260 








265 




270 


Val Gin Arg 


He 


Pro 


Asp 


Pro Ser 


His 


Leu 


Pro Leu Val 


Ala 


Pro 






275 








280 






285 


Trp Lys Thr 


Tyr 


Pro 


Leu 


Phe Phe Gly 


Thr 


Ala He Phe 


Ser 


Phe 




290 








295 






300 


Glu Gly lie 


Gly Met 


Val 


Leu Pro 


Leu 


Glu Asn Lys Met 


Lys 


Asp 






305 








310 






315 


Pro Arg Lys 


Phe 


Pro 


Leu 


He Leu Tyr 


Leu 


Gly Met Val 


He 


Val 






320 








325 






330 


Thr lie Leu 


Tyr 


He 


Ser 


Leu Gly Cys 


Leu Gly Tyr Leu 


Gin 


Phe 






335 








340 






345 


Gly Ala Asn 


He 


Gin 


Gly 


ser He 


Thr 


Leu 


Asn Leu Pro 


Asn 


Cys 






350 








355 






360 
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Trp Leu Tyr Gin Ser Val Lys Leu Leu Tyr Ser lie Gly lie phe 

365 . . . 370 375 

Phe Thr Tyr Ala Leu ^ pin ^Phe,, Tyr Val I?ro .Ala Glu lie lie He , 

380 385 *. 390. 

Pro Phe Phe Val Ser Ala Pro Glu Pro Cys Glu Leu Val Val 
•: 395 * ■,..400 409 

Asp Leu Phe, Val Arg Pro Val Leu Val 'Cys Leu Thr Ser Leu Ser 

410 415 ... 420 

Gly Ser Val Asp Asn Gly Trp Tyr Gly Thr* Glu Ala Asp Gly Thr 

425 .430 . 435 

Ser Cys Gly Ser Ala Pro Leu Val Phe Val Ser Ser Ser . Phe Leu 

440 445 » 450 

Ala His pro Trp Leu Sei; Phe Arg' Cys Glu Sei; Gin Trp Val Ser 

455 . I 460 ... 465 

Cys His Arg Asp Thr Val Val Val Trp Gly Phe Ala Arg Gly He 

.470 475 480 

Leu Ala lie Leu He Pro Arg Leu Asp Leu Val He Ser Leu Val 
' 485 • . 490 ' 495 

Gly Ser Val Ser Ser' Ser Ala 'Leu Ala Leu lie He Pro Pro Leu 

500 505 510 

Leu Glu Vjal Thr. . Thr Phe Tyr Ser Glu Gly Met Ser Pro Leu Thr 

515 '520 525 

He Phe Lys Asp Ala Leu He Ser He Leu Gly Phe Val Gly Phe 

530 535 ■ . I 540 

Val Val Gly Thr Tyr Glu Ala Leu Tyr G\\x Leu He Gin Pro Ser 

545 550 555 

Asn Ala Pro He Phe He Asn Ser Thr Cys . Ala Phe He 

560 • 565 . • 

<210> 9 * • . ' . , 

<211> 958 , 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature , 
<223> Incyte ID No: 7477898CD1 . 

<400> 9 

Met Pro Val Arg Arg Gly His Val Ala Pro Gin Asn Thr Tyr Leu 
1 5 10 15 

Asp Thr He He Arg Lys Phe Glu Gly Gin Ser Arg Lys Phe Leu 
20 25 30 

He Ala Asn Ala Gin Met Glu Asn Cys Ala He 11^ Tyr Cys Asn 
35 40 45 

Asp Gly Phe Cys Glu Leu Phe Gly Tyr Ser Arg Val Glu Val Met 
50 55 60 

Gin Gin Pro Cys Thr Cys Asp Phe Leu Thr Gly Pro Asn Thr Pro 
65 70 75 

Ser ser Ala Val Ser Arg Leu Ala Gin Ala Leu Leu Gly Ala Glu 
80 .85 90 

Glu cys Lys Val Asp He Leu ly^r Tyr Arg Lys Asp Ala Ser Ser 
95 100 - 105 

Phe Arg Cys Leu Val Asp Val Val Pro Val Lys Asn Glu Asp Gly 
110 115 120 

Ala Val He Met Phe lie Leu Asn Phe Gliz Asp Leu Ala Gin Leu 
125 130 135 

Leu Ala Lys Cys Ser Ser Arg Ser Leu Ser Gin Arg Leu Leu Ser 
140 145 150 

Gin Ser Phe Leu Gly Ser Glu Gly Ser His Gly Arg Pro Gly Gly 
155 160 165 

Pro Gly pro Gly Thr Gly Arg Gly . Lys Tyr Arg Thr He Ser Gin 
170 175 180 

He Pro Gin Phe Thr Leu Asn Phe Val Glu Phe Asn Leu Glu Lys 
185 190 195 

His Arg Ser Ser Ser Thr Thr Glu He Glu He He Ala Pro His 
200 205 210 
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Ujf » 


Val 

V A J. 


Val Glu 

VAX V3XUI 


Arg 


Thr Gin Asn 


Val 

VAX 


Thr 


Glu 


Lys 


VAX 


xnr 


r*l Tl 
vsxn 






215 






220 










Val 






Glyi tAla Asp^* Val 


Leu 


Pro *Glu 


lyr 


Lys 


Leu 


\sXTl 








230 






235 






9 An. 


Ala 


Pro 


Ar ^ X 1 e 


His 
245 


Attt Turn TFit* 

<^JL^ JLXaX 


He 


Leu 
250 


nxo 


ixyr 




Pro ' 




Ijys 


Ala 


Vail Tna 


Asp 


Tnr> Li^ti Tl ^ 

XX^I XJdwl aLXC 


Leu Leu 


TiOii 
Xj"vI 


Val • 

VAX 


Tl a 
X x@ 


lyr 


XXiX 








260 














97n 


Ala 


Val 


Phe Thr 


Pro 


Tyr Ser Ala 


Al a 

X&X A 




Leu 


XJt?U 


06X 


A dr\ 


v^XXi 








275 
















Asp 


Glu 


Ser Arg" 


Arg 


Gly Ala Cys 


OtsX 


iyr 


Thr 


t 


OlvX 


Ir X \j 


Leu 








290 






9 Q ^ 








3 00 


Thr 


Val 


Val Asp 


Leu 


He Val Asp 


He 


Met 


Phe 


Val 

VAX 


Val 

VAX 


A Girt 
Ao^ 


Tl o 
XX6 








305 


1 


♦ 


J X u 


» 






O XD 


Val 


lie 


Ash Phe 


Arg 


Thr Thr Tyr 


Val 

VAX 




Thr 


Asn 


AS ^ 


Gl 11 
wXU 


Val 
VAX 








.320 
















Val 


Ser 


His Pro Arg Arg lie Ala 


vax 


rlxS 


Tyr 


Phe 


Lys 


Gly 


Trp 








335 
















. ^ 


Plie 


Leu. 


lie Asp Met' 


Val Ala Ala 


xxe 


Pro 


Phe Asp 


Leu 


Leu 


Tl <=i 

X xe 








350 






355 










360 


Pbe 




Thr Gly 


Ser 


Asp Glu Thr 


Thr, 


Thr 


Leu 


He 


Gly Leu 


Leii 








365 






370 












Lys 




Ala Arg 


Leu 


Leu Arg Leu 


Val 


Arg 


V^l 


Ala 


Arg 


Lys 
1 


Leu 








380 






385 








Ton 




AT*rf 

AX ^ 


Tyr Ser 


Glu 
.395 


Tyr Gly Ala 


Ala 


val 

400 


Leu 


Phe 


Leu 


Leu 


Met. 

^ UD 






Phe Pro 


Leu 


He I^ld Ms 


Trp 


Leu 


Ala 


Cys 


He 


Trp 


Tyr 








410 






415 








420 . 




X X 


Gly Asn Val 


Glu Arg Pro 


Tyr 


Leu 


Glu 


His 


Lys 


He 


T ■» r 

vjiy 








425 






430 










4 JO 






Asp Ser 


Leu 


Gly Val Gin 


Lfeu 


Gly Lys 


Arg 


Tyr 


Asn 


^-Ly 








440 






445 










450 




Asj> 


Pro Ala 


Ser 
455 


Gly Pro Ser 


Val 


Gin 
460 


Asp 


Lys 


Tyr 


Val 


rnr 


ri J.C3L 


Xjeu 


Tyr Phe 


Thr 


Phe Ser Ser 


Leu 


Thr 


Ser 


Val 


Gly 


Phe 


Gly 








470 






475 












VAX 


Ser Pro 


Asn 
485. 


Thr Asn Ser 


Glu 


Lys 
490 


Val 


Phe 


Ser 


He 


Cys 






Leu lie 


Gly 


Ser Leu Met 


Tyr 


Ala 


Ser 


lie 


Phe Gly 


Asn 








500 






505 












VAX 


otsx 


Ala He 


He 


Gin Arg Leu 


Tyr 


Ser 


Gly Thr 


Ala Arg 


Tyr 








515 






520 










CIO 


Hie; 


xiix 


Gin Met 


Leu 


Arg Val Lys 


GxU 


Phe 


He Arg 


Phe 


His 


i0xn 








530 






535 










04t U 




fX V 


Asn Pro 


Leu Arg Gin Arg 


Leu 


Glu 


Glu Tyr 


Phe 


Gin 


tiXS 








545 






550 










DO D 


AT A 


xxp 


Ser Tyr 


Thr 


Asn Gly He 


Asp 


Met 


Asn Ala 


Val 


Leu 


Lys 








560 






565 










O f u 






Pro Glu 


Cys 


Leu Gin Ala 


Asp 


He 


Cys 


Leu 


His 


Leu 


nxS 








575 




580 










D o 3 




Ala 


Leu Leu 


Gin 


His Cys Pro 


Ala 
AX A 


Phe 


Ser 


Gly 


Ala Gly 


Lys 








590 






595 












Gly 




Leu Arg Ala 


Leu Ala Val 


Lys 


Phe 


Lys 


Thr 


Thr 


His 


Ala 
aXa 








605 






610. 








O X3 


Pro 




Gly Asp Thr 


Leu Val His 


Leu 


Gly Asp Val 


Leu 


Ser 


Thr 








620 






625 










D J \J 


Leu 


•■•jr *■ 


Phe He 


Ser 


Arg Gly Ser 


He 


Glu 


He 


Leu 


Arg Asp 


aS_£J 








635 






640 












Val 


Val 


Val Ala 


lie 


Leu Gly Lyis 


Asn 


Asp 


He 


Phe 


Gly Glu 


Pro 








650 






655 










660 


Val 


Ser 


Leu His, Ala Gin Pro Gly 


Lys 


Ser 


Ser 


Ala 


Asp Val 


TV •yf^ 








665 






670 










675 


Ala 


Leu 


Thr Tyr 


Cys 


Asp Leu His 


Lys 


He 


Gin Arg 


Ala Asp 


Leu 








680 






685 










690 


Leu 


Glu 


Val Leu 


Asp 
695 


Met Tyr Pro 


Ala 


Phe 
700 


Ala 


Glu 


Ser 


Phe 


Trp 
705 


Ser 


Lys 


Leu Glu 


Val 


Thr Phe Asn 


Leu 


Arg Asp Val 


Thr Gly 


Gly 
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710 


715 




720 


Leu 


His 


Ser Ser Pro 


Arg Gin Ala Pro Gly Ser Gin Asp 


His 


Gin 






725. 


730 ' ' ■ 




735 


Gly 


Phe 


Phe Leu Ser 


Asp Asn Gin Ser Asp Ala Ala .Pro 


Pro 


Leu 






740 


745 , , ' 




7'50 


Ser 


He 


Ser Asp Ala 


Phe Trp Leu Trp Pro Glu Leu Leu 


Gin 


Glu 






755 


760 




765 


Met 


Pro 


Pro Lys His 


Ser Pro Gin Ser Pro Gin Glu A!sp 


Pro 


Asp 






770 


775 ' 




780 


Cys 


Trp 


Pro Leu Lys 


Leu Gly Ser Arg Leu Glu Gin Leu 


Gin 


Ala 






7iB5 


790 




795 


Qln 


Met 


Asn Airg Leu 


Glu Ser Arg Val Seir Ser Asp Leu 


Ser 


Arg 






800 


/ 805 




810 


He 


Leu 


Gin Leu Leu. 


Gin Lys Pro Met Pr6 Qln Gly His 


Ala 


Ser 






815 


. 820 




825 


Tyr 


He 


Leu Glu. Ala 


Pro Ala Ser Asn Asp Leu Ala Leu 


val 


Pro 




830 


835 




840 


He 


Ala 


Ser Glu Thr Thr S&r Pro Gly Pro Arg. Leu Pro 


Gin Gly 


ph^ 




845 


850 




855 


Leu 


Pro Pro Ala 


Gin Thr , Pro Ser Tyr Gly Asp Leu Asp Asp 






.860 


865 




870 


cys 


ser 


Pro Lys His 


Arg Asn Ser Ser Pro Arg Met Pro 


His 


Leu 






875 


880 




885. 


Ala 


Val 


Ala Met Asp 


Lys Thr Leu Ala Pro Ser Ser Glu 


Gin 


Glu 






890 


895 




900 


Gin 


Pro 


Glu Gly Leu 


Trp Pro Pro Leu Ala Ser Pro Leu His 


Pro 






905 


910 




915 


Leu 


Glu 


Val Gin Gly Leu He Cys Gly Pro Cys Phe Ser 


Ser 


Leu 






920 


925 




930 


Pro 


Glu 


His Leu Gly 


Ser Val Pro Lys Gin Leu Asp Phe Gin Arg 






935 


940 » 




945 


His 


Gly 


Ser Asp Pro Gly Phe Ala Gly Ser Trp Gly His 










950 


955 







<210> 10 

<2ll> 724 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472728CD1 

<400> 10 



Met 


Gly His 


Gin 


Gly 


Pro 


Phe 


Glu Glii Gly Asn Gly Gly -Leu Arg 


1 






5 






10 


15 


Val 


He Ala 


Thr 


Trp Arg 


Arg 


Lys Glu Ala Trp Arg Arg Asp Cys 








20 






25 


30 


Leu 


Leu Gly 


Ala 


Leu 


Pro 


Ser 


Val Ser cys Gly Gly Trp Gly His 








35 






40 


45 


Arg 


Gly "Arg 


Gin 


Thr 


Tyr 


Gly 


Arg Ala Cys 


Gly Val Lys Glu Lys 








50 






55 


60 


Pro 


Phe ser 


Leu 


Leu Gly 


Pro 


Gin He Thr 


Val Tyr Ala Val Trp 








65 






70 


75 


Pro 


Gin ser 


Glu 


Gly 


Pro 


Gin 


Glu Gly Arg 


Leu Arg Val Asn Ser 








80 






- 85 


90 


Ala 


Cys Leu 


Pro 


Pro 


Glu 


Arg 


Gly Leu Thr 


Asn Ala Cys Thr Asn 








95 






100 


105 


His 


Glu Glu 


Leu 


Ser 


Leu 


Asp 


Cys Leu Leu 


Phe Glu Asn Val Asn 








110 






115 


120 


Thr 


Leu Thr 


Leu 


Asp 


Phe 


Cys 


Leu Trp Glu 


Lys Thr Thr He Val 








125 






130 


135 


Pro 


Gly Val 


Leu 


Pro 


Tyr 


Ala 


Gly Leu Thr 


Leu Gin Ser Lys Phe 








140 






145 


150 


Leu 


Leu Gly 


Arg 


Ala 


Leu 


Leu 


Ala Gly Val 


His Val He Thr Leu 








155 






160 


165 


Thr 


Pro Glu 


Arg 


Val 


Thr 


His 


His Val His 


Gly Trp Tyr Met Glu 
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170 175 . 180 

Asp Gly Phe hys Gly Asp Arg Thr Glu Gly Cys Arg Ser Asp Ser 
. 185 190 * ' 195 

Val Ala Val Pro Ala Ala Ala Pro Val Cys Gin Pro Lys Ser Ala 

200 ' • 205 .1 i . . 210 

Thr Asi) Gly Gin Pro Pro Ala Pro Ala Pro Thr Pro Thr Pro Arg 

215 . 220 . , ^ . 225 

Leu. Ser He Ser Ser Airg Ala Tlir Vai Val Ala Arg Met Glu Gly 

230 235 . ' . 240 

Thr Ser Gin Gly Gly Leu Gin Thr Val Met Lys Trp Lys Tlir Val 

245 250 255 

Val Ala He Phe Val Val Val Val Va'l Tyr Leu Val Thr Qly Gly 

260 »: 265 270 

Leu Val Phe Arg Ala Leu Glu Gin Pro Phe Glu Ser S^r Gin Lys 

275 280 285 

Asn Thr He Ala Leu Glu Lys Ala Glu Phe Leu Arg Asp His Val 

290 , 295 . 300 

Cys Val Ser Pro Gin Glu Leu Glu Thr Leu He* Gin His Ala Leu 
, 305 310 315 

Asp Ala Asp Asn Ala Gly Val 'Ser Pro He Gly Asn Ser Ser Asn 
' .320 325 330 

Asn Ser Ser His Trp Asp Leu Gly Ser Ala Phe Phe Phe Ala Gly 

335 340 345 

' Thr Val He Thr Thr Met Tyr Gly Asn He Ala Pro Ser Thr Glu 

350 355 360 

Gly Gly Lys He Phe cys He Leu Tyr Ala He Phe Gly lie Pro. 

365 370 . . 375 

Leu Phe Gly Phe Leu Leu Ala Gly He Gly Asp Gin Leu Gly Thr 

380 385 390 

He Phe Gly Lys Ser He Ala Arg Val Glu Lys Val Phe Arg Lys 

395 400 • . 405 

Lys Gin Val Ser Gin Thr Lys He Arg Val He Ser Thr He Leu 

410 415 420 

Phe He Leu Ala Gly Cys He Val Phe Val Thr He Pro Ala Val 

425 430 . . 435 

lie Phe Lys Tyr He Glu Gly Trp Thr Ala Leu Glu Ser He Tyr 

440 445 450 

Phe Val Val Val Thr Leu Thr Thr Vai Gly Phe Gly Asp Phe Val 

455 460 465 

Ala Val Val Val Phe Arg Gly Asn Ala Gly He Asn Tyr Arg Glu 

470 475 480 

Trp Tyr Lys Pro Leu Val Trp Phe Tip He Leu Val Gly Leu Ala 
. 485 490 495 

Tyr Phe Ala Ala Val Leu Ser Met He Gly Asp Trp Leu Arg Val 

500 505 510 

Leu Ser Lys Lys Thr Lys Glu Glu Val Gly Glu He Lys Ala His 

515 520 525 

Ala Ala Glu Trp Lys Ala Asn Val Thr Ala Glu Phe Arg Glu Thr 
. 530 535 . 540 

Arg Arg Arg Leu Ser Val Glu He. His Asp Lys Leu Gin Arg Ala 

545 ' 550 555 

Ala Thr He Arg Ser Met Glu Arg Arg Arg Leu Gly Leu Asp Gin • 

560 565 570 

Arg Ala His Ser Leu Asp Met Leu Ser Pro Glu Lys Arg Ser Val 

575 580 585 

Phe Ala Ala Leu Asp Thr Gly Arg Phe Lys Ala Ser Ser Gin Glu 

590 595 600 

Ser He Asn Asn Arg Pro Asn Asn Leu Arg Leu Lys Gly Pro Glu 

605 610 615 

Gin Leu Asn Lys His Gly Gin Gly Ala Ser Glu Asp Asn He He 

620 625 630 

Asn Lys Phe Gly Ser Thr Ser Arg Leu Thr Lys Arg Lys Asn Lys 

635 640 645 

Asp Leu Lys Lys Thr Leu Pro Glu Asp Val Gin Lys He Tyr Lys 

650 655 660 

Thr Phe Arg Asn Tyr Ser Leu Asp Glu Glu Lys Lys Glu Glu Glu 

665 670 675 
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Thr Glu Ly9 Met Cys Asn Ser Asp Asn Ser Ser Thr Ala Met 

680 ' . 685 
Thr Asp Cys lie Gin • Gin 'Kis^Ala Glu Ueu *Glu Asn Gly Met 

695 700 
Pro Thr Asp Thr Lys Asp Ang Glu Pro Glu Asn Asn Ser Leu 

' 710 .715 
Glu Asp Arg» Asn 

• » » 

<210> 11 
<211> 470 

<212> PRT . . • . 

<213> Homo sapiens , ^ 

<220> • 
<221> inisc_f eature 
.<223> Incytp ID Wo: .74743 22CD1 

<220> . * * 

<221> unsure 

<222> 253' . 

<223> unknown or other 

<400> 11 ' ' 



Met 


Tvr 


Asn 


Glu 


He 


Leu 


Met Leu 


Gly Ala Lys Leu His 


Pro 


Thr 


1 








5 




Thr* Asn 




10 




15 


Leu 


Lys 


Leu 


Glu 


Glu 


Leu 


Lys 


Lys Gly Met Thr 


Pro 


Leu 










20 








25 ' 




30 


Ala 


Leu 


Ala 


Ala 


Gly 


,Thr 


Gly Lys. 


He 


Gly Asn Arg His 


Asp 


Met 










3 5 








40 




45 


Leu 


Leu 


Val 


Glu 


Pro 


Leu 


Asn Arg 


L^U 


Leu Gin Asp Lys 


Trp 


AsiD 










50 








55 


60 


Arg 


Phe 


Val 


Lys 


Arg 


He 


Phe Tyr 


Phe 


Asn Phe Leu Val 


Tyr 


Cvs 










65 








70 


75 


Leu 


Tyr 


Met 


He 


He 


Phe 


Thr Met Ala Ala Tyr Tyr Arg 


Pro 


Val 










80 








85 • 




90 


Asp 


Gly 


Leu 


Pro 


Pro 


Phe 


Lys Met Glu Lys Thr Gly Asp Tyr 


Phe 










95 








100 




105 


Arg 


Val 


Thr 


Gly 


Glu 


He 


Leu Ser 


Val 


Leu Gly Gly Val 


Tyr 


Phe 










110 








115 


120 


Phe 


Phe 


Arg 


Gly 


He 


Gin 


Tyr Phe 


Leu 


Gin Arg Arg Pro 


Ser 


Met 










125 








130 




135 


Lys 


Thr 


Leu 


Phe 


Val 


Asp 


Ser Tyr 


Ser 


Glu Met Leu Leu 


Phe 


Leu 










140 








145 




150 


Gin 


Ser 


Leu 


Phe 


Met 


Leu 


Aia Thr' 


Val 


Val Leu Tyr Phe 


■ Ser 


His 










155 








160 




165 


Leu 


Lys 


Glu 


Tyr 


Val 


Ala 


Ser Met 


Val 


Phe Ser Leu Ala 


Leu 


Gly 










170 








175 




180 


Trp 


Thr 


Asn 


Met 


Leu 


Tyr 


Tyr Thr Arg Gly Phe Gin Gin Met 


Gly 










185 








190 




195 


lie 


Tyr 


Ala 


Val 


Met 


He 


Glu Lys 


Met 


lie Leu Arg Asp 


Leu 


cys 










200 








205 




210 


Arg 


Phe 


Met 


Phe 


Val 


Tyr 


He Val 


Phe 


Leu Phe Gly Phe 


Ser 


Thr 










215 








220 




225 


Ala 


Val 


Val 


Thr 


Leu 


He 


Glu Asp 


Gly 


Lys Asn Asp Ser 


Leu 


Pro 










230 








235 




240 


Ser 


Glu 


Ser 


Thr 


Ser 


His 


Arg Trp 


Arg 


Gly Pro Ala Xaa 


Arg 


Pro 










245 








250 


255 


Asn 


Ser 


Ser 


Tyr 


Asn 


Ser 


Leu Tyr 


Ser 


Thr Gys Leu Glu 


Leu 


Phe 










260 








265 




270 


Lys 


Phe 


Thr 


He 


Gly 


Met 


Gly Asp Leu Glu Phe Thr Glu 


Asn 


Tyr 










275 








280 




285 


Asp 


Phe 


Lys 


Ala 


Val 


Phe 


He He 


Leu 


Leu Leu Ala Tyr Val 


He 










290 








295 




300 


Leu 


Thr 


Tyr 


He 


Val 


Leu 


Leu Leu 


Asn 


Met Leu He Ala 


Leu 


Met 










305 








310 




315 


Gly 


Glu 


Thr 


Val 


Glu 


Asn 


Val Ser 


Lys 


Glu Ser Glu Arg 


He 


Trp 



17/87 



Leu 
690 
He • 
705' 

720 



.0204520A2_L> 



wo 02/04520 



PCT/USOl/21448 











320 








■J ^ ^ 


J s3 U 






V9 JLZl 






X J. C 


Thr 


lie 


jjeu Asp xnr v^xu liys oer 


Jrne 










335 1 






J ** w 














Lys 


AT a 




iixy dcsx vsxy. LjyS ijeu jjeLi 












350 


1 


1 




J -J -3 ♦ 


^ v7 IJ{ 


V CIA 


Gly 


'Pvnr 




f X w 


As^ 






Aqy*i ■ A THrT" Avrf T'^k't^ f^jta 
^ ^£»JL» xyx /\x ^ J.x^ jf o 


r^xl6 










3 65 








J / w . 


"^ 7 R 


Va.1 


Asp 


Glu 


V dJL 


As in 




X XIX 
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L7XU vjXjr Vax Xjyo /ixy xxix 
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rx Vj> \J 


^ u o 


SeiT 






Leix 




CJ 

fcP JL 
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Arg. 


vcix oc;x. vsx^y <^x^ nxo xx^ 


Lys 










410 








X3 ' 






It Alts 


Ala 


Xiexi 


Vai 




Lau 


Leu 


AX^ V7XLI aXcI O^X- AXd aX ^ 


Asp 










425 








430 


435 


Arg 


Gin 


Ser 


Ala 


•Gin 


Pro 


Glu 


Glu 


Vai Tyr Leu Arg Gin Phe 


Ser 










440 








445 


450 


Gly 


Ser 


lieu 


Lys 


pro 


Glu 


Asp ,Ala 


Glu Vai Phe Lys Ser Pro 


Ala 










455' 








460 


465 


Ala 


Ser 


Gly 


Glu 


Lys 













» . 470 
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Met 


Glu 


Vai 


Lys 


Asn 


Phe Ala 


Vai 


Trp Asp 


Tyr Vai Vai Phe 


Ala 


1 








5 






10 




15 


Ala 


L^u 


Phe- 


Phe 


He 


Ser Ser 


Gly 


lie Gly Vai Phe Phe Ala 


He 










20 






25 




30 


Lys 


Glu Arg Lys 


Lys 


Ala Thr 


Ser 


Arg dlu 


Phe Leu Vai Gly 


Gly 










35 






40 




45 


Arg 


Gin 


Met 


Ser 


Phe 


Gly Pro 


Vai 


Gly Leu 


Ser Leu Thr Ala 


Ser 










50 






55 




60 


Phe 


Met 


Ser 


Ala 


Vai 


Thr Vai 


Leu 


Gly Thr 


Pro Ser Glu. Vai 


Tyr 










65 






70 




75 


Arg 


Phe Gly Ala 


Ser 


Phe Leu 


Vai 


Phe Phe 


He Al^L Tyr Leu 


Phe 










80 






85 


90 


Vai 


He 


Leu 


Leu 


Thr 


Ser Giu 


Leu* 


Phe Leu 


Pro. Vai Phe Tyr 


Arg 










95 






100 


105 


Ser 


Gly 


He 


Thr 


Ser 


Thr Tyr 


Glu 


Tyr Leu Gin L«u Arg Phe 


Asn 










110 






115 




120 


Lys 


Pro 


Vai 


Arg 


Tyr 


Ala Ala 


Thr 


Vai He 


Tyr He Vai Gin 


Thr 










125 






130 


135 


He 


Leu Tyr 


Thr 


Gly Vai Vai 


Vai 


Tyr Ala 


Pro Ala Leu Ala 


Leu 










140 






145 




150 


Asn 


Gin Vai 


Thr 


Gly 


Phe Asp Leu 


Trp Gly 


Ser Vai Phe Ala 


Thr 










155 






160 




165 


Gly 


He 


Vai 


Cys 


Thr 


Phe Tyr 


Cys 


Thr Leu Gly Gly Leu Lys 


Ala 










170 






.175 




180 


Vai 


Vai 


Trp 


Thr 


Asp 


Ala Phe 


Gin 


Met Vai 


yal Met He Vai 


Gly 










185 






190 




195 


Phe 


Leu 


Thr 


Vai 


Leu 


He Gin Gly 


Ser Thr 


His Ala Gly Gly 


Phe 










200 






205 


210 


His 


Asn 


Vai 


Leu 


Glu 


Gin Ser 


Thr 


Asn Gly Ser Arg Leu His 


He 










215 






220 




225 


Phe 


Asp 


Phe 


Asp 


Vai 


Asp Pro 


Leu 


Arg Arg 


His Thr Phe Trp 


Thr 










23 0 






235 




240 


He 


Thr 


Vai 


Gly 


Gly 


Thr Phe 


Thr 


Trp Leu 


Gly He Tyr Gly 


Vai 










245 






250 


255 


Asn 


Gin 


Ser 


Thr 


He 


Gin Arg 


Cys 


He Ser 


Cys Lys Thr Glu 


Lys 



18/87 



BNSOOCe: <WO_<a04sa0A2LI_> 



wo 02/04520 . PCTAJSOl/21448 

: ' ■ .260 265 270 

His Ala Lys Leu Ala Lisu Tyr Phe Asn Leu Leu Gly Leu Trj) lie 

275 280 285 

lie Leu Val Cys Ala Val Phe Ser Gly Leu lie Met Tyr Ser His 

290 . . 2S5 300 

Phe Lys Asp Cys Asp Pro Trp Thr Ser Gly lie lO^e Set Ala Pro 

305 310. 315 . . • * 

Asp Gin Leu Met Pro Tyr Phe Val Met Glu He Phe Ala Thr ' Met 

320 325 • 330 

Pro Gly Leu Pro Gly Leu Phe Val Ala Cys Ala Phe Ser Gly Thr 

335 340 345 

Leu Ser Thr Val Ala Ser Ser He As^ Ala Leu Ala Thr Val Thr 

350 355 • ■ 360 

Phe Glu Asp Phe Val Lys Ser Cys Phe Pro His Leu Se^r Asp Lys • 

365 . 370 . 375 

Leu Ser Thr Trp. He Ser Lys Gly Leu Cys Leu Leu Phe Gly Val 

380 385 - 390 

Met Cys Thr Ser Met Ala Va!l Ala Ala Ser Val. Met Gly Gly Val 
• 395 400 . 405 

VaJ. Gin Ala Ser Leu Ser He fiis Gly Met Cys Gly Gly Pro Met 
- 410 415 420 

Leu Gly Leu Phe Ser Leu Gly He Val Phe Pro Phe Val Asn Trp 

425 430 435 

, Lys Gly Ala Leu Gly Gly Leu Leu l*hr Gly He Thr Leu Ser Phe 

440 . . 445 450 

Trp Val Ala He Gly Ala Phe He Tyr Pro Ala Pro Ala Ser Lys 

455 460 465 • 

Thr Trp Pro Leu Pro Leu Ser Thr Asp Gin 'Cys lie Lys- Ser Asn 

470 475 480 

Val Thr Ala Hir Gly Pro Pro Val Leu Ser Ser Arg Pro Gly He 

485 490 , 495 

Ala Asp Thr Trp Tyr Ser He Ser Tyr Leu Tyr Tyr Ser Ala Leu 

500 505 510 

Gly cys Leu Gly Cys He Val Ala Gly Val He He Ser Leu He 

515 520 525 

Thr Gly Arg Gin Arg Gly Glu Asp He Gin Pro Leu Leu lie Arg 

530 535 540 

Pro val Cys Asn Leu Phe Cys Phe Trp Ser Lys Lys Tyr Lys Thr 

545 550 555 

Leu Cys Trp Cys Gly Val Gin His Asp Ser Gly Thr Glu Gin Glu 

560 565 570 

Asn Leu Glu Asn Gly Ser Ala Arg Lys Gin Gly Ala Glu Ser Val 

575 580 585 

Leu Gin Asn Gly Leu Arg Arg Glu Ser Leu Val His Val Pro Gly 

590 595 .■ . 600 

Tyr Asp Pro Lys Asp Lys Ser Tyr Asn Asn Met Ala Phe Glu Thr 

605 ' 610 • 615 

Thr His Phe 

<210> 13" 
<211> 631 

<212> PRT . • 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477248CD1 

<400> 13 

Met Glu Arg Gin Ser Arg Val Met Ser Glu Lys Asp Glu Tyr Gin 
^ 5 10 15 

Phe Gin His Gin Gly Ala Val Glu Leu Leu Val Phe Asn Phe Leu 

20 25 30 

Leu He Leu Thr He Leu Thr He Trp Leu Phe Lys Asn His Arg 

35 40 45 

Phe Arg Phe Leu His Glu Thr Gly Gly Ala Met Val Tyr Gly Leu 
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lie 


Met 


Gly Leu 


lie 


Leu Arg Tyr 














Glu 




Gly 


Thr 


vai 


Tyr Asp Cys 










•oU 




Ser 




Leu 


Leu 


vai 


Asn He Thr 














Tyr 


Lys 


Arg 


Glu 


lie 


oer (ain His 










11 u 




Asn 


iiia 


He 


Leu 


Glu 


Lys Met Thr 










125 




Asn 


vaX 


Leu 


Leu 


Pro 


Pro He He. 










140 




Lys 


Lys 


Arg His 


Phe 


Phe GJin Asn 








155 






Phe 


Leu Gly 'Thr Ala He Ser 










170 




rsec 


Tyr 


Gly Phe Val Lys Ala .Met 










185' 




Asn 


Gly 


Asp Phe His Phe Thr Asp 










200 




Met 


Ser 


Ala 


Thr 


Asp 


Pro Val Thr 










215 




lieu 


His 


Val 


Asp. 


Pro 


Asp Leu Tyx 










230 




val 


Leu 


Asn Asp Ala Val Ala He 










245 


1 * 


lie 


Tyr 


Ser 


Pro 


Lys 


Glu Asn Pro 










260 




Phe 


Phe 


Gin 


Ser 


Val Gly Asn Phe 










275 




Phe 


Ala 


Met 


Gly 


Ser 


Ala Tyr Ala 










290 




Lys 


Phe 


Thr 


Lys 


Leu 


Cys Glu Phe 










305 




Phe 


Phe 


Leu 


Leu 


Ser 


Trp Ser Ala 










320 




Gly 


Leu 


Thr Gly 


He 


Val Ala Val 










335 




Ala 


His 


Tyr 


Thr Tyr 


Asn Asn Leu 










350 




Thr 


Lys 


Gin 


Leu 


Phe 


Glu Phe Met 










365 






Phe 


Cys 


Tyr 


Met 


Gly Leu Ala 










380 




ixe 


pne 


Asn 


Ala 


Leu 


Phe He Leu 










395 




vai 


Ala 


Arg 


Ala 


Cys 


Asn He Tyr 










410 




Leu 


Gly 


Arg 


Lys 


Gin Lys He Pro 










425 




Met 


Pne 


Ser 


Gly Leu Arg Gly Ala 


• 








440 




Axg 


TV _ _ 

Asn 


Thr 


Glu 


Ser 


Gin Pro Lys 










455 


Leu 


lieu 


Leu 


Val 


Phe 


Phe Thr Val 










470 




j-niT 


Pro 


Met 


Leu 


Thr 


Trp LeU Gin 










485 


Asp 


Glu 


Asn 


Leu 


Lys 


Glu Asp Pro 










500 


Asn 


Asn 


Leu Asp 


Lys 


Asn Met Thr 










515 




Phe 


Arg 


Met 


T3:p 


Tyr 


Ser Phe Asp 










530 




Leu 


Thr 


His 


Ser 


Gly 


Pro Pro Leu 










545 







55 






• 60 


Ala 


Thr 


Ala 


Pro Thr Asp 


He 




•70 




75 • 


Val 


Lys 


Leu 


Thr Phe Ser 


Pro* 




85 




t 


9q r 


Asp Gin 


Val 


Tyr Glu Tyr 


Lys 




100 






105 


Asn 




Asn 


Pro' His Gin 


Gly 




115 






120 


Phe Asp 


Pro 


Glu He Phe 


Phe 




130 






135 


Phe 


His 


Ala 


Giy Tyr Ser 


Leu 




145 


* 




150 


Leu Gly 


Ser 


He Leu Thr 


'Tyr 




160 






165 


Cys 


He 


Val 


He Gly Leu 


He 




175 






180 


He His 


Ala 


Gly Gin Leu 


Lys 




190 






195 


Cys 


Leu 


Phe 


Phe Gly Ser 


Leu 




205 




210 


Val 


Leu 


Ala 


He Phe His 


Glu 




220 


• 




225 


Thr 


Leu 


Leu 


Phet Gly Glu 


Ser 




235 




240 


Val 


Leu 


Thr 


Tyr Ser He 


Ser 




250 






255 


Asn 


Ala 


Phe 


Asp Ala' Ala 


Ala 




265 






270 


Leu 


Gly 


lie 


Phe Ala 'Gly 


Ser 




280 




285 


He 


lie 


Thr 


Ala Leu Leu 


Thr 




295 






300 


Pro 


Met 


Leu 


Glu Thr Gly 


Leu 




310 




315 


Phe 


Leu 


Ser 


Ala Glu Ala 


Ala , 




3*25 






330 


Leu 


Phe 


Cys 


Gly Val Thr 


Gin 




340 






345 


Ser 


Ser 


Asp 


Ser Lys He 


Arg 




355 






360 


Asn 


Phe 


Leu 


Ala Glu Asn 


Val 




370 






375 


Leu 


Phe 


Thr 


Phe Gin Asn 


His 




3 85 






390 


Gly Ala 


Phe 


Leu Ala He 


Phe 




400 






405 


Pro 


Leu 


Ser 


Phe Leu Leu 


Asn 




415 






420 


Trp Asn 


Phe 


Gin His Met 


Met 




430 






435 


He 


Ala 


Phe 


Ala Leu Ala 


He 




445 






450 


Gin 


Met 


Met 


Phe Thr Thr 


Thr 




460 






465 


Trp 


Val 


Phe 


Gly Gly Gly 


Thr 




475 






480 


He Arg 


Val 


Gly Val Asp 


Leu 




490 






495 


Ser 


Ser 


Gin 


His Gin Glu 


Ala 




505 






510 


Lys 


Ala 


Glu 


Ser Ala Arg 


Leu 




520 






525 


His 


Lys 


Tyr 


Leu Lys Pro 


He 




535 






540 


Thr 


Thr 


Thr 


Leu Pro Glu 


Trp 




550 






555 
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Cys Gly Pro lie Ser Arg Leu Leu Thr Ser Pro Gin Ala Tyr Gly 
560 565 \ 570 

Glu Gin Leu Lys Glu Asp Asp Val Glu Cys lie Val Asn Glii Asp 
.575 • 580 , • 585 

Glu Leu Ala lie Asn Tyr Gin Glu* Gin Ala Ser Ser Pro cys Ser 

. 590 . . 595 • ' . 600 • 

Pro Pro Ala Arg Leu Gly Leu Asp Gin Lys Ala S^r Pro Gin Thr 
605 610 ' 615 

Pro Gly Lys Glu Asn lie Tyr Glu Gly Asp Leu Gly Pro Gly Arg 
620 625 630 

• . ♦ 

<210> 14 . • ' . I . » . ' ■ 

<211> 1256 . . ■■ 

<212> PRT - » 

<213> Homo sapiens ^ . 

f<220> • . . 

<2'21> mis'c_feature • 
<223> Incyte ID No: 2944004CD1 

<400> 14 

» Mfet Asp Arg Glu Glu Arg Lys Thr lie Asn Gin Gly Gin Glu Asp 
1 5 .10 15 

Glu Met Glu lie Tyr Gly Tyr Asn Leu Ser Arg Trp Lys Leu Ala 
20 . 25, . 30 

lie Val Ser Leu Gly Val lie Cys Ser Gly Gly Val Ser Pro Pro 
35 40 45 

Pro Leu Tyr Trp Met Pro Glu Trp Arg Val Lys Ala Thr Cys Val 
50 55 • 60 

Arg Ala Ala lie Lys Asp Cys Glu Val Val Leu Leu Arg Thr Thr 
^ 65 70 75 

Asp Glu Phe Lys Met Trp Phe Cys Ala Lys lie. Arg Val Leu Ser 
.80 85 ' 90 

Leu Glu Thr Tyr Pro Val Ser Ser Pro Lys Ser Met Ser Asn Lys 
95 100 105 

Leu Ser Asn Gly His Ala Val Cys Leu lie Glu Asn Pro Thr Glu 
110 115 120 

Glu Asn Arg His Arg lie Ser Lys Tyr Ser Gin Thr Glu Ser Gin 
125 . 130 135 

Gin lie Arg Tyr Phe Thr His His Ser Val Lys Tyr Phe Trp Asn 
140 145 150 

Asp Thr lie His Asn Phe Asp Phe Leu Lys Gly Leu Asp Glu Gly 
155 160 165 

Val Ser Cys Thr Ser He Tyr Glu Lys His Ser Ala Gly Leu Thr 
170 175 ^ 180 

Lys Gly Met His Ala Tyr Arjg Lys Leu Leu Tyr Gly Val Asn Glu 
185 190 . 195 

He Ala^ Val Lys Val Pro Ser Val Phe Lys L6u Leu He Lys Glu 
200 • 205 210 

Val Leu Asn Pro Phe Tyr He Phe Gin Leu Phe Ser Val He Leu' 
215 220 225 

Trp Ser Thr Asp Glu Tyr Tyr Tyr Tyr Ala Leu Ala He Val Val 
230 235 240 

Met Ser He Val. Ser He Val Ser Ser. Leu Tyr Ser lie Arg Lys 
245 250 255 

Gin oyr Val Met Leu His Asp Met Val Ala Thr His Ser Thr Val 
260 265 270 

Arg Val Ser Val Cys Arg Val Asn Glu Glu He Glu Glu He Phe 
275 280 285 

Ser Thr Asp Leu Val Pro Gly Asp Val Met Val He Pro Leu Asn 
.290 295 300 

Gly Thr He Met Pro Cys Asp Ala Val Leu lie Asn Gly Thr Cys 
305 310 315 

He Val Asn Glu Ser Met Leu Thr Gly Glu Ser Val Pro Val Thr 
320 325 330 
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lie Tyr 


Thr 
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He 
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Ser He 
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Asn 






^ 










430 


1 


1 




y| 1 c 




Vol 


i^m vai briy 


Val 


He 


lie 


lie 


Glu. Ser 


Leu Asp 


lie 


lie 
















445 




yi c n 


TxilT 


lie 


f f 17'^ n 7>v*<^ 

inr vai r^ro 


Pro 


Al,a 


Leu 


Fro 


Ala 


Ala 


Met Thr 


Ala 


Gly 






• A t; R 








460 








465 




Val 


lyx Aia v^m 


Arg 


Arg 


Leu 


Lys 


Lys 


116 


Gly lie 


Phe 


Cys 






And 










475 








480 




Ser 


jrro i^in Arg 


lie 


Asn 


He 


Cys 


Gly Gin 


Leu Asn 


Leu 


Val 






yl Q C 










490 








495 


Cys 
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Asp Lys Thr 


Gly Thr 


Leu 


Thr 


Glu Asp 


Gly Leu 


Asp 


Leu 






c n n 










505 






510 


m 

Trp 
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lie Gin Arg 


Val 


Glu 


Asn 


Ala 


Arg 


Phe 


Leu Ser 
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Glu 






DlO 










520 








525 
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Val Cys Asn 


GlU 


Met 


Leu 


Val 


Lys . Ser 


Gin Phe 


Val 


Ala 






CO A 

b3 0 










535 








540 


Cys 


Met 


Ala Thr Cys 


His 


Ser 


Leu 


Thr 


Lys 


He 


Glu Gly 


Val 


Leu 






545 










550 






555 




Gly 


Asp Pro Leu 


Asp 


Leu 


Lys 


Met 


Phe 


Glu 


Ala tie 


Gly 


Trp 






c ^ n 

560 










565 






570 


lie 


Lieu 


T ^ ^ /^1»'^ TV T ~% 

GlU GlU Ala 


Thr 


Glu 


Glu 


Glu 


Thr 


Ala 


Leu His 


Asn 


Arg 






575 










580 








585 


lie 


Met 


Pro Thr Val 


Val 


Arg 


Pro 


Pro 


Lys 


Gin 


Leu Leu 


Pro 


Glu 






• b-y u 
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Ser 


Thr 


Pro Ala Gly 


Asn 


Gin 


Glu 


Met 


Glu 


Leu 


Phe Glu 
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Pro 






dU5 










610 
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TV T -a 

Ala. 


Thr 


Tyr GlU lie 


Gly 


lie 


Val 


Arg 


Gin 


Phe 


Pro Phe 


Ser 


Ser 
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Gin Arg Met 


Ser 


Val 


Val 


Ala 


Arg Val 


Leu Gly 


Asp 


Arg 






0 Jo 
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Asp Ala Tyr 


Met 


Lys 
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% 1 » 
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Pro 


Glu 


Ala He 


Ala 


Gly 
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Leu 




Lys Pro G lu 


Thr 


Val 


Pro 


Val 


Asp 


Phe 


Gin Asn 


Val 


Leu 






665 










670 




Ala Leu 
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«« 

v^xu 


Asp 


Phe Thr Lys 


Gin Gly Phe 


Arg 


Val 


He 


Ala 


His 






680 










685 








690 


Airgf 


Lys 


Leii Gla Ser 


Lys 


Leu 


Thr 


Trp 


His 


Lys 


Val Gin 


Asn 


He 






695 










700 






1 05 


Seir 


Arg Asp Ala lie 


GlU 


Asn 


Asn 


Met 


Asp 


Phe 


Met Gly 


Leu 


He 






710 










715 






^ 0 
720 


ixe 


Met 


Gin Asn Lys 


Leu Lys 


Gin 


Lys 


Thr 


Pro 


Aia vai 


Leu 


Glu 






725 










730 






• 


735 


Asp 


Leu 


His Lys Ala 


Asn lie Arg 


Thr 


Val 


Met 


Val Thr 


Gly 


Asp 






740 
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/50 




Met 


Leu Thr Ala 


Val 


Ser 


Val 


TV 1 — » 

Ala 


Arg Asp 


Cys Gly 


Met 


lie 






755 
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7 ob 


i>eii 


Pro Gin Asp Lys 


Val 


He 


He 


TV 1 •a 

iiia 


Glii 


Ala 


Leu Pro 


Pro 


Lys 
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/ oU 


Asp 


Gly Lys Val Ala 


Lys 


He 


Asn 


Trp 


His 


Tyr 


Ala Asp 


Ser 


Leu 






785 
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7Q ^ 


Thr 


Gin 


Cys Ser His 


Pro 


Ser 


Ala 


He 


Asp 


Pro 


Glu Ala 


He 


Pro 






800 










805 








810 


Val 


Lys 


Leu Val His 


Asp 


Ser 


Leu 


Glu 


Asp 


Leu 


Gin Met 


Thr 


Arg 






815 










820 








825 


Tyr 


His 


Phe Ala Met 


Asn Gly Lys 


Ser 


Phe 


Ser 


val He 


Leu 


Glu 
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830 835 ^40 

His Phe Glti Asp Leu Val Pro Lys Leu Met Leu His Gly Thr Val 

845 „ 8i50 . . . . , 855 , 

Phe Ala Arg Met Ala Pro Asp Gin Lys Thr Gin Leu lie Glu Ala 
. 860 ^ 865 870 ' 

L^u Gin Asn Val Asp Tyr Phe Val Gly Met cys Gly Asp Gly Ala' 
875 '880 .885 

Asn Asp cys Gly Ala Leu Lys Arg Ala His Gly Gly. lie Ser Leu 

890 895 900 . 

Ser Glu Leu Glu Ala Ser Val Ala Ser Pro Phe Thr Ser Lys Thr 
905 910 915 

Pro Ser lie Ser Cys Val Pro Asn Leu He Arg Glu. Gly Arg Ala 
. 920 , 925 930 

Ala Leu Hie Thr Ser Phe Cys Val Phe Lys Phe .Met Ala Leu Tyr 
935 . 940 945 

Ser lie He Gin .Tyr Phe Ser Val Thr Leu Leu Tyr Ser He Leu 
950 955 960 

Ser Asn Leu» Gly Asp -Phe Gin Phe Leu Phe He Asp Leu Ala He 
965i . • 970 975 

He Leu Val Val Val Phe Thr Met Ser Leu Asn Pro Ala Trp Lys 
- 980 985 990 

Glu Leu Val Ala Gin Arg Pro Pro Ser Gly Leu He SBr Gly Ala 
995 . 1000 1005 

Leu Leu Phe Ser Val Leu Ser Gin He He tie Cys He Gly Phe 
1010 1015 1020 

Gin Ser Leu Gly Phe Phe Trp Val Lys Gin Gin Pro Trp Tyr Glu 
1025 , * 1030 1035 

Val Trp His Pro Lys Ser Asp Ala Cys Asn Thr Thr Gly i Ser Gly 
, 1040 1045 ..' 1050 

Phe Trp Ash Ser Ser -His Val Asp Asn Glu Thr Glu Leu Asp Glu 
1055 , 1060 ' 1065 

His Asn He Gin Asn Tyr Glu Asn Thr Thr Val Phe Phe He Ser 
1070 1075 1080 

Ser Phe Gin Tyr Leu lie Val Ala He Ala Phe Ser Lys Gly Lys 
1085 1090 1095 

Pro Phe Arg Gin Pro Cys Tyr Lys Asn Tyr Pl?e Phe Val Phe Ser 

1100 1105 . . 1110 • 

Val He Phe Leu Tyr He Phe He Leu Phe He Met Leu Tyr Pro 
1115 1120 1125 

val Ala Ser Val Asp Gin Val Leu Gin He Val Cys Val Pro Tyr 
1130 1135 1140 

Gin Trp Arg Val Thr Met Leu He He Val Leu Val Asn Ala Phe 
1145 1150 1155 

Val Ser He Thr Val Glu Asn Phe Phe Leu Asp Met Val Leu Trp 
1160 1165 1170 

Lys Val Val Phe Asn Arg Asp Lys Gin Gly Glu Tyr Arg Phe Ser 
1175 1180 1185 

Thr Thr Gin Pro Pro Gin Glu Ser Val Asp Arg Trp Gly Lys Cys 
1190 1195 1200 

Cys Leu Pro Trp Ala Leu Gly Cys Arg Lys Lys Thr Pro Lys Ala 
1205 - 1210 1215 

Lys Tyr Met Tyr Leu Ala Gin Glu Leu Leu Val Asp Pro Glu Trp 
1220 1225 1230 

Pro Pro Lys Pro Gin Thr Thr Thr Glu Ala Lys Ala Leu Val Lys 
1235 1240 ' 1245 

Glu Asn Gly Ser Cys Gin lie He Thr He Thr 
1250 1255 
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<2il> 499 

<212> PRT 
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<220> 
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<400> 15 . 








Met 


Leu 


His 


Ala 


Leu 


Leu 


1 










Leu 


Leu 


Leu 


Thr 


He 


Cys 










20 




Fxie 


vjj.y 


Tyr Asn 


Leu 


Ser 








35 




Gin 


AalU 


fne 


Thr 


Asn 


Glu 










50 




Leu 


pro 


Asp 


His 


Leu 


Val 




• 






65 




Leu. 


lyr 


Pro 


Leu 


Gly Gly 










80 




Leu 


Ala 


lie 


Thr 


Leu 


Gly 










95 


JLxe 


Pile 


Val 


Val 


•Ser 


Ala 










110 




■» T _ 

Axa 


Gly 


Ser 


Phe Glu Met 










125* 






Asn 


Ala Gly Val 


Ser 










140 




GlU 


Ser 


Ala 


Pro 


Lys 


Glu 










155 




Ala 


lie 


I^he 


Thr 


Ala 


Leu 










17 0 




Leu 


Arg 


Glu 


Leu 


Leu 


Gly 










185 




Ala 


Ser 


Cys 


Leu 


Val 


Pro 










200 




Leu 


Leu 


Pro 


Glu 


Ser 


Pro 










215 




Thr 


Glu 


Ala 


cys 


Leu 


Ala 








230 




Asp 


Leu 


Ala Gly Glu 


Leu 










245 




Cys 


Gin 


Gly Cys Arg 


Ala 










260 




Arg 


n *i _ 
Ala 


Leu Arg Arg 


Gin 










275 




Ala 


Met 


Glu 


Leu 


Cys 


Gly 










290 




Ser 


Val 


Phe 


Arg 


Lys 


Ala 










305 




Ala 


lie 


lie 


Gly Thr 


Gly 










320 




Ser 


Cys 


Val 


Val 


He 


Glu 










335 




Gly 


Gly 


Tyr 


Ser 


Leu 


Met 










350 




TV n -» 
Ala 


Leu 


Cys 


Leu 


Gin 


Ser 










365 




Met 


Ala 


Cys 


He 


Phe 


Ala 










380 




TV T _ 

Ala 


Gly 


Val 


Thr 


Gly 


He 










395 




Ala 


Arg 


Pro 


Ala 


Ala 


Cys 










410 




neu 


Leu 


lie 


Leu 


Val 


Gly 










425 


Leu 


Ser 


His 


Phe 


Leu 


Tyr 










440 


Gly 


Ala 


He 


Tyr 


Thr 


Gly 










455 




Thr 


Phe 


Gin 


Glu 


He 


Ser 










470 




Arg 


Arg 


Ala 


Gin 


Gly 


Pro 










485 
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Arg 


Ser 


Arg 


Thr 


He Gin Gly Arg 


He 






•10 








15 ' 


Ala Ala 


Gly 


He Gly Gly Thr 


Phe 


Gin* 


1 






25 








3q f 


He 


lie 


Asn Ala 


Pro 


Thr Leu 


His 


He 








40 








45 


Thr 


Trp 


Gin 


Ala 


Arg 


Thr*' Gly 


Glu 


Pro 








55 








60 ' 


Leu 


Leu 


Met 


Torp 


Ser 


Leu He 


Val 


Ser 








70 








75 


jjeu 


Pne. 


Gly Ala 


Leu Leu' Ala Gly Pro 






L^Vs 


85 








90 


Ang 


Lys 


Ser 


Leu 


'Leu Val 


Asn 


Asn 








100 








105 


Ala 


He 


Leu 


Phe 


Gly 


Phe Ser Arg Lys 








115 








120 


He 


,Met 


Leu Gly Arg 


Leu Leu Val Gly 








130 








135 


Met 


Asn 


He 


Gin 


Pro 


Met Tyr 


Leu 


Gly 








145 








150 


Leu. Arg 


Gly Ala 


Vai 


Ala Met 


Ser 


Ser 








160 








165 


Gly 


He 


Val 


Met 


Gly 


Gin val 


Vil 


Gly 








175 








180 


Gly Pro 


Gin 


Ala 


Trp 


Pro Leu 


Leu 


Leu 








190 








195 


Gly Ala 


Leu 


Gin 


Leu 


Ala Ser 


Leu 


Pro 








205 








210 


Arg 


Tyr 


Leu 


Leu 


He 


Asp Cys 


'Gly Asp 








220 








225 


Ala 


Leu 


Arg 


Gin 


Leu 


Arg Gly 


Ser 


Gly 








235 








240 


Glu 


Glu 


Leu 


Glu 


Glu 


Glu Arg Ala Ala 








250 








255 


Arg Arg 


Pro 


Trp 


Glu 


Leu Phe 


Gin 


His . 








265 








270 


Val 


Thr 


Ser 


Leu 


Val 


Val Leu 


Gly 


Ser 








280 






285 


Asn Asp 


Ser 


Val 


Tyr 


Ala Tyr 


Ala 


Ser 








295 








300 


Gly Val 


Pro 


Glu 


Ala 


Lys He 


Gin 


Tyr 








310 








315 


Ser 


Cys 


Glu 


Leu 


Leu 


Thr Ala 


Val 


Val 








325 








330 


Arg 


Val 


Gly Arg 


Arg 


Val Leu 


Leu 


He 








340 








345 


Thr 


Cys 


Trp 


Gly 


Ser 


He Phe 


Thr 


Val 








355 








360 


Ser 


Phe 


Pro 


Trp 


Thr 


Leu Tyr 


Leu 


Ala 








370 






375 


Phe 


He 


Leu 


Ser 


Phe 


Gly He 


Gly 


Pro 








3 85 








390 


Leu 


Ala 


Thr Glu Leu Phe Asp Gin Met 








400 








405 


Met 


Val 


Cys 


Gly Ala Leu Met 


Trp 


He 








415 








420 


Leu Gly 


Phe 


Pro 


iPhe 


He Met 


Glu 


Ala 








430 








435 


Val 


Pro 


Phe 


Leu Gly Val Cys Val Cys 








445 








450 


Leu 


Phe 


Leu 


Pro 


Glu 


Thr Lys 


Gly 


Lys 








460 








465 


Lys 


Glu 


Leu 


His 


Arg 


Leu Asn 


Phe 


Pro 








475 








480 


Thr 


Trp 


Arg 


Ser 


Leu Glu Val 


He 


Gin 








490 








495 
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<400> 16 . V 
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Ala 


Ala 


Asn 


Ser. 


Thr 


Ser Asp 


Leu His Thr Pro Gly , Thr 


Gin 


. 1 








5 






10 




15 


Leu 


Ser 


Val 


Ala • 


Asp 


lie 


lie Val 


He -Thr Val Tyr Phe 


Ala 


Leu 










20 






25 




30 


Asn 


Val 


Ala 


Val 


Gly 


He 


T±p Ser 


Ser Cys Arg. Ala Ser Arg Asn 










35 






40 




45 


*Thr Val Asn 


Gly 


Tyr 


Phe 


Leu, Ala 


Gly Arg Asp Met Thr Trp 


Trp 








. - 


50 






55 




60 


Pro 


lie Gly 


Ala 


Ser 


Leu 


Phe Ala 


Ser Ser Glu Gly Ser Gly Leu 










65 






70 




75 


Pile 


lie Gly 


Leu 


Ala 


Gly 


Ser Gly 


Ala Ala Gly Gly Leu Ala Val 










80 




85 . . 




90 


Ala Gly Phe 


Glu 


Trp 


Asn 


Ala Thr 


Tyr Val Leu Leu Ala 


Leu 


Ala 










95 






100 




105 


Trp 


Val 


Phe 


yal 


Pro 


lie 


Tyr lie 


Ser Ser 'Glu. He Val Thr 


Leu 










110 






115 




120 


Pro 


Glu 


Tyr 


lie 


Gin 


Lys 


Arg Tyr 


Gly Gly Gin Arg He Arg Met 










125 






130 1 




135 


Tyr 


Leu 


Ser 


Val 


Leu 


Ser 


Leu Leu 


Leu Ser Val Phe Thr 


Lys 


He 








140 






145 


150 


Ser 


Leu Asp 


Leu 


Tyr 


Ala 


Gly Ala 


Leu Phe Val His lie 


cys 


Leu 










155 






160 


165 


Gly Trp Asn 


Phe 


Tyr 


Leu 


Ser Thr 


He Leu Thr Leu Gly ile 


Thr 










170 






175 




180 


Ala 


Leu 


Tyr 


Thr 


lie 


Ala 


Gly Gly 


Leu Ala Ala Val He 


Tyr 


Thr 










185 






190 




195 


Asp 


Ala 


Leu 


Gin 


Thr 


Leu 


He Met 


Val Val Gly Ala Val 


He 


Leu 










200 






205 




210 


Thr 


He 


Lys 


Ala 


Phe 


Asp 


Gin He 


Gly Gly Tyr Gly Gin 


Leu 


Glu 










215 






220 




225 


Ala 


Ala 


Tyr 


Ala 


Gin 


Ala 


He Pro 


Ser Arg Thr lie Ala 


Asn 


Thr 










230 






235 




240 


Thr 


Gys 


His 


Leu 


Pro 


Arg 


Thr Asp 


Ala Met His Met Phe 


Arg 


Asp 










245 






250 




255 


.Pro 


His 


Thr 


Gly Asp 


Leu 


Pro Trp 


Thr Gly Met Thr Phe 


Gly 


Leu 










260 






265 




270 


Thr 


lie 


Met 


Ala 


Thr 


Trp 


Tyr Trp 


Cys Thr Asp Gin Val 


lie 


Val 










275 






280 




285 


Gin 


Arg 


Ser 


Leu 


Ser 


Ala Arg Asp 


Leu Asn His Ala Lys 


Ala 


Gly. 










290 






295 




300 


Ser 


lie 


Leu 


Ala 


Ser 


Tyr 


Leu Lys 


Met Leu Pro Met Gly Leu 


lie 










305 


310 




315 


lie 


Met 


Pro 


Gly Met 


He 


Ser Arg 


Ala. Leu Phe Pro Asp 


Asp 


Val 








320 






325 


330 


Gly cys Val 


Val 


Pro 


Ser 


Glu Cys 


Leu Arg Ala Cys Gly Ala 


Glu 










335 






340 




345 


Val 


Gly cys 


Ser 


Asn 


ile Ala Tyr 


Pro Lys Leu Val Met 


Glu 


Leu 










350 






355 




360 


Met 


Pro 


lie 


Gly Leu 


Arg Gly Leu 


Met He Ala Val Met 


Leu 


Ala 










365 






370 




375 


Ala 


Leu 


Met 


Ser 


Ser 


Leu 


Thr Ser 


He Phe Asn Ser Ser 


Ser 


Thr 










380 






385 




390 


Leu 


PHe 


Thr 


Met 


Asp 


He Trp Arg 


Arg Leu Arg Pro Arg 


Ser 


Gly 










395 






400 




405 
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Glu Arg 


Glu Leu 


Leu 


Leu 


Val Gly Arg Leu Val 


He Val 


Al^. 


Leu 






A 1 fx 






415 






420 


He 61y 


Val Ser 


Val 


Ala Trp lie Pro Val Leu Gin Asp 


Ser 


Asn 






/IOC 






43 0 








435 


Ser Gly 


Gin* Leu' 


Pne 


He Tyr Met. Gin 


Ser 


Val 


Thr S^r 


Ser 


Leu 






A A Ci 






445 








450 


Aid Pro 


Jtrro vaji 


Thr 


Ala 


Vai Phe Val 


Leu 


Gly Val Phe 


Trp. Arg 






/IRC 






460 








465 


ATQ Ala. 


Asn Glu 


C:>j.n 


Gly Ala Phe Trp Gly Leu 


tie Ala 


Gly Leu 






4 / U 






475 








480 


VaX VSIX 


Gly Ala 


Thr 


Arg Leu Val Leu Glu Phe Leu Asn 


Pro 


Ala 






yl Q c 






490 








495 


iriO iriO 


cys GJLy 


Glu 


Pr6 Asp Thr Arg 


Pro 


Ala Val Leu 


Gly 


Ser 












505 




• 


510 


xJLe nxS 


ryr jjeu 


nlS 


Phe Ala Val Ala 


Leu 


Phe 


Ala Leu 


Ser 


Gly 




vai vsljl 


CI c 






520 








525 


AJLa vail 


Jiia 


Gly Ser Leu. Leu 


Thr 


Pro 


Pro Pro 


Gin 


Ser 






CI ft 






535 






540 


|Va.X GxT\ 


I 1 e Giu 


Asn 


Leu Thr Trp Trp Thr Leu Ala Gin 


Asp 


Val 


1 




C A C 

545 






550 






555 


Pro Leu 


Gly Thr 


Lys 


Ala Gly Asp Gly Gin 


Thr 


Pro Gin 




nxs 






560 






565 






570 


Ala Phe 


Trp Ala 


Arg 


Val 


Cys Gly Phe 


Asn 


Ala 


He Leu 


Leu 


Met 






575 






580 








585 


Cys val 


Asn He 


Phe 


Phe Tyr Ala Tyr 


Phe 


Ala 












590 






595 
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<400> 17 


















Met Asp 


Cys Ser 


Leu 


Val 


Arg Thr Leu 


Val 


His 


Arg Tyr 


Cys 


Ala 


1 




. 5 






10 






15 


Gly Glu 


Glu Asn 


Trp 


Val 


Asp Ser Arg 


Thr 


He 


Tyr Val 


Gly 


His 






20 






25 






30 


Arg Glu 


Pro Pro 


Pro 


Gly 


Ala Glu Ala 


Tyr 


He 


Pro Gin 


Arg 


Tyr 






35 






40 






45 


Pr6 Asp 


Asn Arg 


He 


Val 


Ser Ser Lys 


Tyr 


Thr 


Phe Trp 


Asn 


Phe 






50 






55 






60 


He Pro 


Lys Asn 


Leu 


Phe 


Glu Gin Phe 


Arg Arg Val Ala 


Asn 


Phe 






65 






70 








75 


•Tyr Phe 


Leu lie 


He 


Phe 


Leu Val Gin 


Leu 


He 


He Asp 


Thr 


Pro 






80 






85 






90 


Thr Ser 


Pro Vai 


Thr 


Ser 


Gly Leu Pro 


Leu 


Phe 


Phe Val 


He 


Thr 






95 






100 








105 


Val Thr 


Ala He 


Lys 


Gin 


Gly Tyr Glu Asp Trp Leu Arg 


His 


Lys 






110 






115 








120 


Ala Asp 


Asn Ala 


Met 


Asn 


Gin cys Pro 


Val 


His 


Phe He 


Gin 


His 






125 






130 








135 


Gly Lys 


Leu Val Arg 


Lys 


Gin Ser Arg 


Lys 


Leu Arg Val 


Gly Asp 




Met Val 


140 






145 








150 


He Val 


Lys 


Glu 


Asp Glu Thr 


Phe 


Pro 


Cys Asp 


Leu 


He 






155 






160 






165 


Phe Leu 


Ser Ser 


Asn 


Arg 


Gly Asp Gly Thr 


Cys 


His Val 


Thr 


Thr 






170 






175 






180 


Ala Ser 


Leu Asp Gly 


Glu 


Ser Ser His 


Lys 


Thr 


His Tyr 


Ala 


Val 






185 






190 






195 


Gin Asp 


Thr Lys 


Gly 


Phe 


His Thr Glu 


Glu 


Asp He Gly 


Gly Leu 






200 






205 








210 


His Ala 


Thr He 


Glu 


Cys 


Glu Gin Pro 


Gin 


Pro 


Asp Leu 


Tyr 


Lys 






215 






220 




225 



26/87 



BNSOOCID: <WO_0204520A2_L> 



wo 02/04520 



PCTAJSOl/21448 



Phe Val Gly Arg lie Asn Val Tyx Ser Asp Leu Asn Asp Pro Val 

230 . 235 240 

Val Arg Pro Leu Gly,Ser •<Slu».Asn Leu Leu Leu Arg Gly Ala Thr * 

245 250 ' 255. 

Leu Lys Asn Thr G.lu Lys lie Phe Gly Val Ala lie Tyr Thr Gly 

260 . 265 270' 

Met Glu .Thr, Lys Met Ala Leu Asn Tyr 'c^ln Ser Lys Ser Gin Lys 

275 280 . . 285 

Arg Ser Ala Val Glu Lys Ser Met Asn Ala Phe Leu He Val Tyr • 

290 295 300 

Leu Cys lie Leu lie Ser Lys Ala Leu lie Asn Thr Val Leu Lys 

305 310 . 315 

Tyr Val Trp Gin Ser Gl^ Pro Phe Arg Asp Glu Pro Trp Tyr Asn 

320 , • 325 • 330 

Gin Lys Thr Glu Ser Glu Arg Gin Arg Asn Leu Phe Leu Lys Ala 

,335 340 345 

Phe Thr Asp Phe Leu Ala Phe Met Val Leu Phe Asn Tyr lie lie 
* 350 • 355 * 360 

Pro Val Ser Met Tyr' Val Thr Val Glu Met Gin Lys- Phe Leu Gly 

365 370. 375 

Ser Tyr Phe lie Thr Trp Asp Glu Asp Met Phe Asp Glu Glu Thr 

380 '385 • 390 

Gly Glu Gly Pro Leu Val Asn Thr Ser Asp fjeu Asn Glu Glu Leu 

395 400 I 405 

Gly . Gin Val Glu Tyr He Phe Thr Asp Lys Thr Gly Thr Leu Thr 
" 410 415 420 

GlU Asn Asn Met Glu Phe Lysi Glu Cys Cys He Glu Gly His Val 

425 • 430 i 435 

Tyr Val Pro Hi,s Val He Cys Asn Gly Gin Val Leu Pro Glu Ser 

440' 445 , 450 

Ser Gly He Asp Met He Asp Ser Ser Pro Ser Val Asn Gly Arg 

455 460 465 

Glu Arg Glu Glu Leu Phe Phe Arg Ala Leu Cys Leu Cys His Thr 

470 475 480 . 

Val Gin Val Lys Asp Asp Asp Ser Val Asp Gly Pro Arg Lys Ser 

485 . 490 . 495 

Pro Asp Gly Gly Lys Ser Cys Val Tyr lie Ser Ser. Ser Pro Asp * 

500 505 510 

Glu Val Ala Leu Val Glu Gly Val Gin Arg Leu Gly Phe Thr Tyr 

515 520 525 

Leu Arg Leu Lys Asp Asn. Tyr Met Glu He Leu Asn Arg Glu Asn 

530 535 540 

His He Glu Arg Phe Glu Leu Leu Glu lie Leu Ser Phe Asp Ser 

545 550 ' , . 555 

Val Arg Arg Arg Met Ser Val He. Val Lys Ser Ala Thr ,Gly Glu 

560 : 565 570 

He Tyr Leu Phe Cys Lys Gly Ala Asp Ser Ser - Tie Phe Pro Arg 

575 580 585 

Val He Glu Gly Lys Val Asp Gin He Arg Ala Arg Val Glu Arg 

590 595 600 

Asn Ala Val Glu Gly Leu Arg Thr Leu cys Val Ala Tyr Lys Arg 

605 610 615 

Leu He Gin Glu Glu Tyr Glu Gly He Cys Lys Leu Leu Gin Ala 

620 625 630 

Ala Lys Val Ala Leu Gin Asp Arg Glu Lys Lys Leu Ala Glu Ala 

635 .640 645 

Tyr Glu Gin He Glu Lys Asp Leu Thr Leu Leu Gly Ala Thr Ala 

650 655 660 

Val Glu Asp Arg Leu Gin Glu Lys Ala Ala Asp Thr He Glu Ala 

665 670 675 

Leu Gin Lys Ala Gly He Lys Val Trp Val Leu Thr Gly Asp Lys 

680 685 690 

Met Glu Thr Ala Ala Ala Thr Cys Tyr Ala Cys Lys Leu Phe Arg 

695 700 705 

Arg Asn Thr Gin Leu Leu Glu Leu Thr Thr Lys Arg He Glu Glu 

710 715 720 

Gin Ser Leu His Asp Val Leu Phe Glu Leu Ser Lys Thr Val Leu 
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725 730 . 735 

Arg His Ser Gly Ser Leu Thr Arg Asp Asn Leu Ser Gly Leu Ser 
740 745 " • 750- 

Ala Asp Met Gin Asp Tyr Gly Leu lie lie Asp Gly -Ala Ala Leu 
755 . • 760 . , , . . 7'65 

Ser Leu lie Met Lys Pro Arg Glu Asp Gly Ser Ser Gly Asn Tyr 
7.70 775 , . 780 

Arg Glu Leu Phe Leu Glu lie Cys Arg Ser Cys Ser A'la Val Leu 
785 790 ' . 795 

Cys Cys Arg Met Ala Pro Leu Gin Lys Ala Gin lie Val Lys Leu 
800 805 810 

lie Lys Phe Ser Lys Glu His Pro Il-e Thr Leu Ala lie Qly Asp 
815 t 820 825 

Gly Ala Asn Asp Val Ser Met lie Leu Glu Ala His V^l Gly lie 
830 835 840 

Gly Val He Gly Lys Glu Gly Arg Gin Ala Ala Arg Asn Ser Asp 
845 850 855 

Tyr Ala lie Pro Lys Phe Lys His Leu Lys Lys* Met Leu Leu Val 
J 860 865' 870 

His Gly His Phe Tyr Tyr . lie. -Arg lie Ser Glu Leu Val Gin Tyr 
- .875 880 885 

Phe Phe Tyr Lys Asn Val Cys Phe lie phe Pro Gin Phe Leu Tyr 
890 895 900 

• Gin Phe Phe Cys Gly Phe Ser Gin Gin Thr Leu Tyr Asp Thr Ala 
905 910 915 

Tyr Leu Thr Leu Tyr Asn lie Ser Phe Thr Ser Leu Pro lie Leu 
920 925 . . 930 

Leu Tyr Ser Leu Met Glu Gin His Val Gly lie Asp Val Leu Lys 
935 940 945 

Arg Asp Pro Thr Leu Tyr Arg Asp Val Ala Lys Asn Ala Leu Leu 
950 955 • 960 

Arg Trp Arg Val Phe lie Tyr Trp Thr Leu Leu Gly Leu Phe Asp 
965 970 975 

Ala Leu Val Phe Phe Phe Gly Ala Tyr Phe Val Phe Glu. Asn Thr 
980 985 990 

Thr Val Thr Ser Asn Gly Gin lie Phe Gly Asn Trp Thr Phe Gly 
995 1000 1005 

Thr Leu Val Phe Thr Val Met Val Phe Thr Val Thr Leu Lys Leu 
. 1010 1015 1020 

Ala Leu Asp Thr His Tyr Trp Thr Trp lie Asn His Phe Val lie 
1025 1030 1035 

Trp Gly Ser Leu Leu Phe Tyr Val Val Phe Ser Leu Leu Trp Gly 
1040 1045 1050 

Gl'i' Val lie Trp Pro Phe Leu Asn Tyr Gin Arg Met Tyr Tyr Val 
1055 1060 1065 

Phe He Gin Met Leu Ser Ser Gly Pro Ala Trp Leu Ala He Val 
1070 1075 108O 

Leu Leu Val Thr He Ser Leu Leu Pro Asp Val Leu Lys Lys Val 
1085 1090 1095 

Leu Cys Arg Gin Leu Trp Pro Thr Ala Thr Glu Arg Val Gin Gin 
1100 ' 1105 1110 

Asn Gly Cys Ala Gin Pro Arg. Asp Arg Asp Ser Glu Phe Thr Pro. . 

1115 1120 1125 

Leu Ala Ser Leu Gin Ser Pro Gly Tyr, Gin Ser Thr Cys Pro Ser 
1130 1135 1140 

Ala Ala Trp Tyr Ser Ser His Ser Gin Gin Val Thr Leu Ala Ala 
1.145 1150 . 1155 

Tirp Lys Glu Lys Val Ser Thr Glu Pro Pro Pro He Leu Gly Gly 
1160 1165 1170 

Ser His His His Cys Ser Ser .He Pro Ser His Ser Cys Pro Arg 
1175 1180 1185 

Ser Arg Val Gly Met Leu Val 
1190 
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<213> Homo sapiens 

<220> • .^^ 

<221> mis cofeature 

<223> Incyte ID No: 7474127CD1 

<400> 18 



Met 


Gly 


Lys' 


He 


Glu 


Asn 


Asn Glu 


n 

JL 








5 








mr 


Arg His 


Glu Thr 


Tyr Arg 










20 








Tnr 


Arg 


Leu 


Ala 


Leu 


Leu Ala 










35 






Cys 


Lieu 


Thr 


Thr 


Ala 


Gly 


Asp Lys 










50 






T All 

lieu 


Ser 


Pro 


Pro 


Pro 


Arg 


Ala Pro 










* .65 








Cys 


Phe 


Glu 


Gly 
80, 


•Gly 


Ala Gly 


Arg 


Ala 


Ser 


Asp 


His 


Pro 


Gly Gly 




■ 






95 






Arg 


His 


Pro Gly Val Phe 


Ala Tyr 










110 




Gly 


Lys 


Leu 


His 


Cys 


Pro 


Ala Asp 










125 






Glti 


Glu 


Leu 


Ala 


Phe 


Trp 


Gly lie 










•140 


Cys 


Cys 


Trp 


Met 


Thr 


Tyr 


Arg Gin 










155 






Leu 


Asp 


lie 


Phe 


Glu .Thr 


Pro Asp: 










17 o; 






Asp 


Asp 


Glu 


Asp 


Leu 


Ala 


Ala Lys 










185 




Ala 


Gly 


Leu Gly Gly 


Pro 


Asp Gly 










200 






Leu 


Gin 


Pro 


Arg 


Met 


Trp 


Ala Leu 










215 






Arg 


Ala 


Ala 


Arg 


Phe 


He 


Ala Phe 










230 






Val 


Ser 


He 


Thr 


Thr 


Phe 


Cys Leu 










245 




lie 


Val 


Lys 


Asn 


Lys 


Thr 


Glu Pro 










260 






Val 


Leu 


Gin 


Tyr 


Glu 


lie 


Glu Thr 










275 






Glu 


Gly 


Val 


Cys 


Val 


Val 


Trp Phe' 










290 






lie 


Val 


Phe 


Ser 


Pro 


Asn 


Lys Leu 










305 




Asn 


lie 


He 


Asp 


Phe 


Val 


Ala He 










320 








Leu 


Ser 


Gly 


Leu 


Ser 


Ser Lys 










335 




irxie 


Leu 


Arg Val 


Val 


Arg 


Pile Val 










350 






Leu 


Thr 


Arg His 


Phe 


Val 


Gly Leu 










365 






Arg 


Ala 


Ser 


Thr 


Asn 


Glu 


Phe Leu 








380 






Leu 


Gly 


Val 


Leu 


He 


Phe 


Ala Thr 










395 






\Zal 


Gly 


Ala 


Gin 


Pro 


Asn 


Asp Pro 








410 




Phe 


Lys 


Asn 


He 


Pro 


He 


Gly Phe 










425 








Thr 


Leu Gly Tyr 


Gly 


Asp Met 



440 



Arg 


Val 


He Leu Asn 


Val 


Gly 




10 








15 


Ser 


Thr 


Leu 


Lys Thr 


Leu 


Pro 




25 






30 


Ser 


Ser 


Glu 


Pro, Pro 


Gly 


Asp 












45 


L^U 


Gin 


Pro 


, Ser Pro 


Pro 


Pro 




55 








60 


Pro 


Leu 


Ser 


Pro Gly 


Pro 


Gly 




70 








75 


Asn 


Cys 


Ser 


Ser Arg 


Gly 


Gly 




85 








90 


Gly 


Arg 


Glu 


phe Phe 


Phe 


T^p 




100 








105 


Val' 


Leu 


Asn 


Tyr Tyr 


Arg 


Thr 




115 








120 


Val 


Cys 


Gly 


Pro Leu Phe 


Glu 




130 








135 


Asp 


Glu 


Thr 


Asp Val 


Glu 


Pro 




145 








150 


His 


Arg 


Asp 


Ala Glu, 


,Qlu 


Ala 




160 








165 


Leu 


He 


Gly 


Gly Asp 


Pro 


Gly 


Arg 


175 








180 


Leu Gly 


He Glu 


Asp 


Ala 




.190 






195 


Lys 


Ser 


Gly 


Arg Trp 


Arg 


Arg 




205 








210 


Phe 


Glu 


Asp 


Pro Tyr 


ser 


S0r 




2,20 




225 


Ala 


Ser 


Leu 


Phe Phe 


He 


Leu 




235 








240 


Glu 


Thr 


His 


Glu Ala 


Phe 


Asn 




250 








255 


Val 


He 


Asn 


Gly Thr 


Ser 


Val 




26.5 








270 


Asp 


Pro 


Ala 


Leu Thr Tyr 


Val 




280 








285 


Thr 


Phe 


Glu 


Phe Leu 


Val 


Arg 




295 








300 


Glu 


Phe 


He 


Lys Asn 


Leu 


Leu 




310 








315 


Leu 


Pro 


Phe 


Tyr Leu Glu 


Val 




325 








330 


Ala 


Ala Lys 


Asp Val 


Leu 


Gly 




340 








345 


Arg 


He 


Leu 


Arg ile 


Phe 


Lys 




355 








360 


Arg 


Val 


Leu 


Gly His 


Thr 


Leu 




370 








375 


Leu 


Leu 


He 


lie Phe 


Leu 


Ala 




385 








390 


Met 


He 


Tyr 


Tyr Ala Glu 


Arg 




400 








405 


Ser 


Ala 


Ser 


Glu His. 


Thr 


Gin 




415 








420 


Trp 


Trp 


Ala 


Val Val 


Thr 


Met 




43 0 








435 


Tyr 


Pro 


Gin 


Thr Trp 


Ser 


Gly 




445 








450 
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Met 


Lgu 


Val Gly 


Ala Leu 


Cys 


Ala 






455 






Ala 


Met 


Pro Val 


Pro* Val He' 


•Val 








470 






Ser 


Leu 


Ala Met 


Ala Lys 


Gin Lys 








485 






His 


lie 


Prot Pro 


Ala Pro 


Gin 


Ala 








500 






Thr 


Glu 


Leu Asn 


Met Ala 


cys 


Asn 








515 . 




Leu 


Gly 


Lys Asp 


Asn Arg 


L.eu 


Leu 








530 






Ser 


Gly 


Asp Asp 


Ser Thr 


Gly Ser 






545 






GlU 


Arg 


Leu Pro 


He Arg 


Arg 


Ser 








.560 






Arg 


Gly 


Glu Thr 


Cys Phe 


Leu 


Leu 








575* 






Ala 


Ser 


Asp Gly 


Gly' He 


Arg 


Lys 








590 






Leu 


Asn 


Aen lie 


Ala Gly 


Leu. 


Ala 








605 






Pro 


Val 


Thr Ser 


Pro Tyr 


Asn 


Ser 






620 






Arg 


Ser 


Pro He 


Pro Ser 


He 


Leu 








635 
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<221> mis cofeature 

<223> Incyte ID No: 7476949GD1 

<400> 19 



Met 


Ser Lys Asp 


Leu 


Ala 


Ala 


Met 


1 


5 








Gly 


Val Arg Thr 


Glu 


Thr 


Ala 


Pro 






20 








Val 


Gly Leu His 


Ala 


Tyr Asp 


He 






35 








Val 


Phe Val lie 


Ala 


Val Gly He 






50 








Arg 


Gly Thr He 


Gly Gly Tyr 


Phe 






65 








Trp 


Trp Pro He 


Gly 


Ala 


Ser 


Leu 






80 








Gly 


Leu Phe He 


Gly 


Leu 


Ala 


Gly 






95 








Ala 


Val Gly Gly 


Phe 


Glu 


Trp 


Asn 






110 








Leu 


Gly Trp Val 


Phe 


Val 


Pro 


Val 






125 








Thr 


Met Pro Gin 


Tyr 


Leu 


Lys 


Lys 






140 








Gin 


Val Tyr Met 


Ser 


Val 


Leu 


Ser 




155 








Lys 


He Ser Thr 


Asp 


He 


Phe 


Ser 






170 








Ala 


Leu Gly Trp 


Asn 


Leu Tyr 


Leu 




185 








Val 


Thr Ala Val 


Tyr 


Thr 


He 


Ala 






200 








Tyr 


Thr Asp Ala 


Leu 


Gin 


Thr 


Val 



215 



Leu Ala Gly Val Leu Thr lie 

460 465 
Asn Asn 'Phe Gly Met T^x Tyr • 

475 480. 
Leu Pro Arg Lys Arg Lys Lys , 

;490 495* 
Ser'feer Pro Thr Phe Cys Lys 

505 v- 510 

Ser Thr Gin Ser Asp Thr Cys ■ 

520 . 525 
Glu His Asn Arg Ser Vail Leu 

535 • 540 

Glu Pro Pro Leu Ser Pro Pro 
* 550 : ' 555 
Ser Thr Arg Asp Lys Asn Arg 

565 570 
Thr Thr Gly Asp Tyr Thr Cys 

580 585 
Gly Tyr Glu Lys* Ser Arg Ser 

595 600 
Gly Asn Ala Leu Arg Leu Ser 
■ 610 615 
Pro cys Pro Leu Arg Arg . Ser 

625 I 630 



Gly Pro Gly Ala Ser Gly Asp 




10 




IS 


His 


He 


Ala Leu Asp Ser 


Arg 




25 




30 


Ser 


Val 


Val. Val He Tyr 


Phe 




40 


45 


Trp 


Ser 


Ser He Arg Ala 


Ser 




55 




60 


Leu 


Ala 


Gly Arg Ser Met 


Ser 




70 




75 


Met 


Ser 


Ser Asn Val Gly 


Ser 




85 




90 


Thr 


Gly 


Ala Ala Gly Gly Leu 




IDO 




105 


Ala 


Thr 


Trp Leu Leu Leu 


Ala 




115 




120 


Tyr 


He. 


Ala Ala Gly Val 


Val 




13 0 




135 


Arg 


Phe 


Gly Gly Gin Arg 


He 




145 




150 


Leu 


He 


Leu Tyr He Phe 


Thr 




160 


165 


Gly Ala 


Leu Phe He Gin Met 




175 




180 


Ser 


Thr 


Gly He Leu Leu 


Val 




190 


195 


Gly Gly 


Leu Met Ala Val 


He 




205 




210 


He 


Met 


Val Gly Gly Ala 


Leu 




220 




225 
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Val 


Leu 


Met Phe 


Leu 


Gly Phe Gin 








230 


Leu 


Glu 


Gin Arg 


Tyr 


Arg Gin Ala 








245 




Asn 


Thr 


Thr Cys 


His 


Leu Pro Arg^ 








260 


Arg 


Asp 


Pro Val 


Ser 


Gly Asp He 








275 




Gly 


Leii 


Thr Val 


Leu 


Ala Thr Trp 






290 


He 


Val 


Gin Arg 


Ser 


Leu Ser Ala 








305 




Gly 


Gly Ser Val 


Leu. 


Gly Gly Tyr 








320 




Phe 


He 


Val Met 


Pro 


Gly Met He. 








335 




Glu 


Val 


Gly Cys 


Val 


Asp Pro Asp 








350 




Ala 


Arg Val Gly Cys 


Ser Asn He 


1 






365 




Alai 


Leu 


Met Pro Val 


Gly Leu Arg 








380 


Met 


Ala 


Ala Leu 


Met 


Ser Ser Leu 








395 




Ser 


Thr 


Leu Phe 


Thr 


He Asp Val 








410 


Ser 


Thr 


Glu Gin 


Glu 


Leu Met Val 








425 




Phe 


Leu 


Val Val 


He 


Ser He Leu 








440 




Ser 


Asn 


Ser Gly 


Gin 


Leu Phe Asp 








455 


Tyr 


Leu 


Ala Pro 


Pro 


He Thr Ala 








470 




Cys 


Lys Arg Val 


Thr 


Glu Pro Gly 








485 




Gly 


Leu Gly Val 


Gly 


Leu Leu Arg 








500 




Pro 


Ala 


Pro Ala 


Cys 


Gly Glu Val 








515 


hys 


Asp 


Phe His 


Tyr 


Leu Tyr Phe 








53 0 




Thr 


Ala 


He Val 


He 


Val He Val 


• 






545 




Pro 


Glu Glu Glh 


Leu 


Thr Arg Leu 








560 




Pro 


Leu 


Ser Glu 


Leu 


Glu Lys Glu 








575 


lie 


Ser Glu Arg 


Pro 


Ala Gly Glu 








590 




Ala 


Glu 


-Asn Ser 


Ser 


Leu Gly Gin 








605 




Arg 


Ser 


Trp Gly 


Lys 


Leu Leu Trp 








620 




Gly 


Thr 


Pro Glu 


Gin 


Ala Leu Ser 


Glu 






635 




Gin 


Lys Leu 


Thr 


Ser He Glu 








650 




Val 


Cys 


Asn He 


Asn 


Ala Val Leu 






665 




Leu 


Trp 


Gly Tyr 


Phe 


Ala 








680 





<210> 20 

<211> 1096 

<212> PRT 

<213> Homo sapiens 



Asp. 


Val Gly Trp Tyr Pro 


Gly 




235 


240 


He 


Pro Asn Val Thr Val 


Pro 




250 


255 


Pro Asp Ala Phe His He 

265 " .: • V 


Leu 


-270 


Pro 


Trp Pro Gly Leu He 


Phe 




280 • • 


285 


Cys 


Trp Cys Thr Asp Gin Val 




295 


300 


Lys 


Ser Leu Ser His Ala 


Lys 




310 


315 


Leu Lys He Leu Pro Met Phe 




325 , , . 


330 


Ser 


Arg Ala Leu Phe Pro 


Asp 




340 


345 


Val 


Cys Gin Arg He Cys 


Gly 




355 


360 


Ala Tyr Pro Lys Leu Val 


Met 




370 


375 


Gly Leu Met He Ala Val 


He 




385 


390 


Thr 


Ser He Phe Asn Ser 


Ser 




400 


405 


Tm 


Gin Arg Phe Arg Arg 


Lys 




415 


420 


Val 


Gly Arg Val Phe Val 
430* 


Val 




435 


Tm 


He Pro He He Gin 


Ser 




445 


450 


Tyr 


He Gin Ala Val Thr 


Ser 




460 


465 


Leu 


Phe Leu Leu Ala He 


Phe 




475 


480 


Ala 


Phe Trp Gly Leu Val 


Phe 




490 


495 


Met 


He Leu Glu Phe Ser 


Tyr 




505 


510 


Asp 


Arg Arg Pro Ala Val 


Leu 




520 


525 


Ala 


He Leu Leu Cys Gly Leu 




535 


540 


Ser 


Leu Cys Thr Thr Pro 


He 




550 


555 


Thr 


Trp Trp Thr Arg -Asn Cys 




565 


570 


Ala 


His Glu Sfer Thr Pro 


Glu 




580 


585 


Cys 


Pro Ala Gly Gly Gly Ala 




595 


600 


Glu 


Gin Pro Glu Ala Pro 


Ser 




610 


615 


Ser 


Trp Phe cys Gly Leu 


Ser 




625 


63 0 


Pro 


Ala Glu Lys Ala Ala 


Leu 




-640 


645 


Glu 


Glu Pro Leu Trp Arg 


His 




655 


660 


Leu 


Leu Ala He Asn He 


Phe 




670 


675 
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Met 


Trp Arg Trp lie Arg Gin Gin Leu Gly. Phe Asp Pro Pro 


His 


1 








5 








10 






15 


Gin 


Ser 


Asp Thr Arg Thr lie Tyr Val Ala Asn Arg Phe Pro Gin 










20 








25 






30 


Asn 


Gly Leu 


Tyr 


Thr 


Pro 


Gin 


Lys Phe 


lie 


Asp Asn Arg 


He 


He 










35 








40 






45 


Ser 


Ser 


Lys 


Tyr 


Thr 


Val 


Trp 


Asn Phe 


Val 


Pro Ly^ Asn 


Leu 


Phe 










50 








55 


60 


Glu 


Gin. 


Phe 


Arg 


Arg 


Val 


Ala 


Asn Phe. 


Tyr 


Phe Leu He 


He 


Phe 










65 








70 






75 


Leu 


Val 


Gin 


Leu Met 


He 


Asp 


Thr Pro 


Thr 


Ser Pro Val 


Thr 


Ser 










80 






85 






90 


Gly 


Leu 


Pro 


Leu 


Phe 


Phe Val 


He Thr Val Thr- Ala He liys Gin 








95 








100 






105 


Gly 


Tyr Glu Asp Trp Leu. Arg. 'His Asn Ser Asp Asn Glu Val Asn 










110 








115 






X ^ w 


Gly 


Ala Pro Yal Tyr Val Val Arg Ser Gly Gly Leu Val Lys 


XIJIX. 










125 








130 






135 


Arg 


Ser 


Lys 


Asn 


He Arg Val 


Gly Asp 


He 


Val Arg He 


Ala 


T Arc 










140 








145 






150 


Asp. 


Glu 


He 


irne 


Pro 


Ala Asp 


Leu Val 


Leu 


Leu Ser Ser Asp 


Arg 








155 








160. 






165 


Leu 


Asp Gly 




Cys 


His 


Val 


Thr Thr Ala 


Ser Leu Asp. 


Gly 


Glu 










170 








175 






180 


Thr 


Asn 


Leu 


Lys 


Thr 


His 


Val 


Ala Val 


Pro 


Glu Thir Ala 


Leu 


Leu 








18S 








190 






195 


Gin 


Thr 


Val 




Asn 


Leu 


Asp 


Thr Leu 


Val 


Ala Val He 


Glu 


Cys 










200 






205 






210 


Gin 


Gin 


Pro 




Ala 


Asp 


Leu 


Tyr Arg 


Phe 


Met Gly Arg 


Met 


lie 










215 








220 






225 


lie 


Thr 


Gin 


Glh 


Met 


Glu 


Glu 


He Val 


Arg 


Pro Leu Gly 


Pro 


Glu 










230 








235 






240 


Ser 


Leu 


Leu 


Leu 


Arg 


Gly 


Ala 


Arg Leu 


Lys 


Asn Thr Lys 


Glu 


He 










245 








250 






255 


Phe 


Gly Val 


Ala 


Val 


Tyr 


Thr 


Gly Met 


Glu 


Thr Lys Met 


Ala 


Leu 










260 








2 65 






270 


Asn 


Tyr 


Lys 


Ser 


Lys 


Ser 


Gin 


Lys Arg 


Ser 


Ala Val Glu 


Lys 


Ser 










275 








280 






285 


Met 


Asn 


Thr 


Phe 


Leu 


He 


He 


Tyr Leu 


Val 


He Leu He 


Ser 


Glu 










290 








295 






300 


Ala 


Val 


He 


Ser 


Thr 


He. 


Leu 


Lys Tyr 


Thr 


Trp Gin Ala 


Glu 


Glu 










305 








310 






315 


liys 


Trp 


Asp 


Glu 


Pro Trp 


Tyr Asn Gin Lys 


Thr Glu His 


Gin 


Arg 










320 








325 






330 


Asn 


Ser 


Ser 


Lys 


Val 


Glu Tyr Val. Phe Thr Asp Lys Thr 


Gly Thr 










335 








340 






345 


Leu 


Thr 


Glu 


Asn 


Glu 


Met 


Gin. Phe Arg Glu Cys Ser He Asn Gly 










350 








355 






360 


Met 


Lys 


Tyr 


Gin 


Glu 


He Asn Gly Arg 


Leu 


Val Prp Glu Gly Pro 










365 








370 






375 


Thr 


Pro 


Asp 


Ser 


Ser 


Glu Gly Asn Leu 


Ser 


Tyr Leu Ser 


Ser 


Leu 








.380 








385 






390 


Ser 


His 


Leu 


Asn 


Asri 


Leu 


Ser 


His Leu 


Thr 


Thr Ser Ser 


Ser 


Phe 










395 








400 






405 


Arg 


Thr 


Ser 


Prp 


Glu 


Asn 


Glu 


Thr Glu 


Leu 


He Lys Glu 


His 


Asp 










410 








415 






420 


Leu 


Phe 


Phe 


Lys 


Ala 


Val 


Ser 


Leu Cys 


His 


Thr Val Gin 


He 


Ser 








425 








430 






435 


Asn 


Val 


Gin 


Thr 


Asp 


Cys 


Thr Gly Asp Gly 


Pro Trp Gin 


Ser 


Asn 










440 








445 






450 


Leu 


Ala 


Pro 


Ser 


Gin 


Leu Glu Tyr Tyr 


Ala 


Ser Ser Pro 


Asp 


Glu 










455 








460 






465 
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Lys 


Ala 


Leu 


Val 


Glu 


Ala Ala Ala Arg He 


Gly 


lie Val 


Phe 


fie 










470 


475 








480 


Gly 


Asn 


Ser 


Glu 


Glu J 


JThr .Met,, Glu Val Lys 


Thr 


Leu Gly Lys 


Leu , 










485 


490 








495, 


Glu 


Arg 


Tyr 


Lys 


Leu 


Leu His He Leu Glu 


Phe Asp Ser 


Asp 


Arg 










5'00 


' . 505 








SlO- ^ 


Arg 


Arg 


Met 


Ser 


Val 


He Val Gin Ala 'Pro 


Ser Gly Glu Lys 


Leu 










515 


520 








525 


Leu 


Phe 


Ala Lys 


Gly Ala Glu Ser Ser He 


Leu 


Pro Lys 


Cys 


He . 










530 


535 






540 


Gly 


Gly 


Glu 


He 


Glu 


Lys Thr Arg He His 


Val 


Asp Glu 


Phe 


Ala 










545 


550 








555 


L^ii 


Lys 


Gly Leu 


Arg 


Thi;- Leu Cys" ile Ala 


Tyr Arg Lys 


Phe 


Thr 










560 


. 565 








?70 


Ser 


Lys 


Glu Ty^ 


Glu 


Glu He Asp Lys Arg 


He 


Phe Glu 


Ala 


Arg 










575 


580 








585 


rThr 


Ala 


Leu Gin Gin Arg Glu Glu Lys Leu Ala Ala Val Phe Gin 










590 


595 








600 


Phe 


lie 


Glu Lys Asp» Leu He »Leu Leu Gly Ala Thr. Ala Val Glu 










605 


610 








615 


Asp 


Arg 


Leu Gin.. Asp Lys Val Arg Glu Thr 


He 


Glu Ala 


Leu Arg 










620 


■625 








63 0 


Met 


Ala 


Gly He Lys 


Val Trp Val Leu Thr 


Gly. Asp Lys 


His 


Glu 










635 


640 








645 


Thr 


Ala 


Val 


Ser 


Val 


Ser Leu Ser Cys Gly His 


Phe His 


Arg 


Thr 










650 


655 








660 


Met 


Asn 


He 


Leu 


Glu 


Leu H^ Asn Gin Lys 


Ser 


Asp Ser 


Glu 


Cys 










665 


670 




1 


1 - ■ ■ 


675 


Ala 


Glu 


Gin Leu Arg Gin Leu Ala Arg Arg 


lie 


Thr Glu 


Asp 


His 










680 


685 








690 


Val 


He 


Gin 


His 


Gly 


Leu Val Val A^p Gly 


Thr 


Ser Leu 


'Ser 


Leu 










695 


700 








705 


Ala 


Leu 


Arg Glu 


His 


Glu Lys Leu Phe Met 


Glu 


Val Cys 


Arg 


Asn 










710 


715 








720 


cys 


Ser 


Ala 


Val 


Leu 


cys Cys Arg Met Ala 


Pro 


Leu Gin 


Lys 


Ala 










725 


730 






735 


Lys 


Val 


He Arg 


Leu 


He Lys He Ser Pro 


Giu 


Lys, Pro 


lie 


Thr • 










740 


745 








750 


Leu 


Ala 


Val Gly Asp Gly Ala Asn Asp Val 


Ser Met He 


Gin 


Glu 










755 


760 








765 


Ala 


His 


Val 


Gly 


He 


Gly He Met Gly Lys 


Glu Gly Arg Gin Ala 










770 


775 








780 


Ala 


Arg 


Asn 


Ser 


Asp 


Tyr Ala He Ala Arg 


Phe Lys Phe 


Leu 


Ser 










785 


790 








795 


Lys 


Leu 


Leu 


Phe 


Val 


His Gly His Phe Tyr Tyr lie Arg He Ala 










800 


805 








810 


Thr 


Leu 


Val 


Gin 


Tyr 


Phe Phe Tyr Lys Asn 


Val 


cys Phe 


He 


Thr 










815 


820 






825 


Pro 


Gin 


Phe 


Leu 


Tyr 


Gin Phe Tyr Cys Leu 


Phe 


Ser Gin 


Gin 


Thr 










83 0 


835 








840 


Leu 


Tyr 


Asp 


Ser 


Val 


Tyr Leu Thr Leu Tyt Asn 


lie Cys 


Phe 


Thr 










845 


850 






855 


Ser 


Leu 


Pro 


He 


Leu 


He Tyr Ser Leu Leu 


Glu 


Gin His 


Val 


Asp 










860 


865 








870 


Pro 


His 


Val 


Leu 


Gin 


Asn Lys Pro Thr Leu 


Tyr 


Arg Asp 


He 


Ser 










875 


880 






885 


Lys 


Asn 


Arg 


Leu 


Leu 


Ser lie Lys Thr Phe 


Leu 


Tyr Trp 


Thr 


lie 










890 


895 






900 


Leu 


Gly 


Phe 


Ser 


His 


Ala Phe He Phe Phe 


Phe 


Gly Ser 


Tyr 


Leu 










905 


910 








915 


Leu 


He 


Gly Lys 


Asp 


Thr Ser Leu Leu Gly Asn 


Gly Gin 


Met 


Phe 










920 


925 








930 


tjxy 


TV 

Asn 


Trp 


Thr 


Phe Gly Thr Leu Val Phe 


Thr 


Val Met 


Val 


He 










935 


. 940 








945 


Thr 


Val 


Thr 


Val 


Lys 


Met Ala Leu Glu Thr 


His 


Phe Trp 


Thr 


Trp 










950 


955 






960. 


He 


Asn 


His 


Leu 


Val 


Thr Trp Gly Ser lie 


lie 


Phe Tyr 


Phe 


Val 
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9.65 970 -975 

Phe Ser Leu Phe oyr Gly Gly He Leu Trp Pro Phe Leu Gly Ser 

980,^ " 985 • . ' ' 990 • 

Gin. Asn. Met Tyr Phe Val Phe He Gin Leu Leu Ser Ser Gly Ser- 
9.95 , , . . 1000 . 1005 

Ala Trp Phe Ala. He He Leu Met Val Val Thr Cys Leu Phe Leu 

I 1010 1515 . • 1020 

Asp lie He Lys Lys Val Phe Asp Arg His Leu His» Pro Thr Ser 

1025 1030 1035 " 

Thr Glu Lys Ala Gin Leu Thr Glu Thr Asn Ala Gly He Lys Cys 
1040 . . 1045 . 1050 

Leu Asp Ser Met Cys Cys Phe Pro.. Glu Gly Glu Ala! Ala Cys Ala 
1055 • 1060 . 1065 

Ser Val Gly Arg Met Leu Gl^n Arg Vdl He Gly 'Arg Cys Ser Pro 
1070 . 1075 1080 

Thr His He Ser .Arg Cys Glu He Ser Leu Ser Ser Leu Cys Cys 
.1085 1090 1095 
Arg * • . 



<210> 21 , 

<211> 707 * : 

<212> PRT . . • 

<213> Homo sapiens . i 

<220> 

<221> misc^feature • * . 

<223> Incyte ID No: 7477720CD1 . . » 
<400> 21 ' • 



Met 


Ala 


Leu 


Gin 


Met 


Phe Val 


Thr 


Tyr 


Ser 


Pro Trp Asn 


Cys 


Leu 


1 








5 








10 


15 


Leu 


Leu 


Leu 


Val 


Ala 


Leu Glu 


Cys 


Ser 


CjtJ-U 


Ala Ser Ser 


Asp 


Leu 










20 








25 




30 


Asn 


Glu 


Ser 


Ala 


Asn 
35 


Ser Thr 


Ala 


Gin 


Tyr 
40 


Ala Ser Asn 


Ala 


Trp 
45 


Phe 


Ala 


Ala 


Ala 


Ser 


Ser Glu 


Pro 


Glu 


Glu 


Gly He Ser 


Val 


Phe 










50 








55 




60 


Glu 


Leu Asp 


Tyr Asp 


Tyr Val 


Gin 


He 


Pro 


Tyr Glu Val 


Thr 


Leu 










65 








70 




75 


Trp 


He 


Leu 


Leu 


Ala 


Ser. Leu 


Ala 


Lys 


He 


Gly Phe His 


Leu 


Tyr 


His 








80 








85 




90 


Arg Leu 


Pro 


Gly Leu Met 


Pro 


Glu 


Ser 


Cys Leu Leu 


He 


Leu 










95 








100 




105 


Val 


Gly Ala 


Leu 


Val 


Gly Gly 


He. 


He 


Phe 


Gly Thr Asp 


■His 


Lys 










110 








115 




120 


Ser 


Pro 


Pro 


Val 


Met 


Asp Ser 


Ser 


He 


Tyr 


Phe Ijfeu Tyr 


Leu 


Leu 










125 








130 




135 


Pro 


Pro 


He 


Val 


Leu 


Glu Gly Gly 


Tyr 


Phe 


Met Pro Thr 


Arg 


Pro 










140 








145 




150 


Phe 


Phe 


Glu 


Asn 


He 


Gly Ser 


He 


Leu 


Ttp 


Trp Ala Val 


Leu 


Gly 










155 








160 




165 


Ala 


Leu 


He 


Asn 


Ala 


Leu Gly 


He 


Gly 


Leu 


Ser Leu Tyr 


Leu 


He 










170 








175 




180 


Cys 


Gin 


Val 


Lys 


Ala 


Phe Gly Leu 


Gly 


Asp 


Val Asn Leu 


Leu 


Gin 










185 








.190 






195 


Asn 


Leu 


Leu 


Phe Gly 


Ser Leu 


He 


Ser 


Ala 


Val Asp Pro 


Val 


Ala 










200 








205 




210 


Val 


Leu 


Ala 


Val 


Phe 
215 


Glu Glu 


Ala 


Arg 


Val 
220 


Asn Glu Gin 


Leu 


Tyr 
225 


Met 


Met 


He 


Phe Gly Glu Ala Leu 


Leu 


Asn 


Asp Gly He 


Thr 


Val 


Val 








230 








235 




240 


Leu 


Tyr 


Asn 


Met 


Leu He 


Ala 


Phe 


Thr 


Lys Met His 


Lys 


Phe. 


Glu 








245 








250 




255 


Asp 


He 


Glu 


Thr 


Val Asp 


He 


Leu 


Ala 


Gly Cys Ala 


Arg 


Phe 


He 








260 








265 




270 


Val 


Val 


Gly Leu Gly Gly Val 


Leu 


Phe 


Gly He Val 


Phe Gly 
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275 








280 








285 


Phe 


He 


Ser 


Ala 


Phe 


He 


Thr Arg Phe 


Thr 


Gin Asn 


He 


Ser 


Ala 


• il' 








290 








295 






300 


He 


Glu 


Pro 


Leu . 


He 


Val 


Phe 


Met Phe 


Ser 


Tyr Leu • 


Ser 


Tyr 


Leu 










305 








310 




i 




315 


Ala Ala Glu 


Thr 


Leu 


Tyr 


Leu 


Ser Gly 


He 


Leu Ala 


He 


Thr Ala 










320 






325 




Val 




330 


Cys 


Ala 


Val 


Thr 


Met 


Lys 


Lys 


Tyr Val 


Glu 


Glu Asn 


Ser 


Gin 








335 








340 






345 


Thx 


Ser 


Tyr 


Thr 


Thr 


He 


Lys 


Tyr Phe 


Met 


Lys Met 


Leu 


Ser 


Ser 








3 50 






355 








360 


Val 


Ser 


Glu 


Thr 


Leu 


He 


Phe 


He Phe 


Met 


Gly Val 


Ser 


Thr 


Val 










3 65 








370 








375 


Gly Lys 


Asn 


His 


Glu 


Trp 


Asn 


Trp Ala 


Phe 


He Cys 


Pile 


Thr 


Leu 










380 








385 








390 


Ala 


Phe 


Cys 


Gin 


He 


Trp Arg Ala He 


Ser 


Val Phe 


Ala 


lieu 


Phe 








395 








400 








405 


Tyr 
1 


He 


Ser 


Ash 


Gin 


Phe Arg 


Thr Phe 


Pro 


Phe' S6r 


He 


Lys 


Asp 








410 








415 








420 


Gin 


Cys 


He 


He Phe Tyr Ser 'Gly Val Arg Gly Ala Gly Ser 


Phe 








425 








43 0 








435 


jser 


Leu 


Ala 


Phe 


Leu 


Leu 


Pro 


Leu Ser 


Leu 


Phe Pro 


Arg Lys 


Lvs 










440 








445 








450 


• Met 


Phe 


Val 


Thr 


Ala 


Thr 


Leu 


Val Val 


He 


Tyr Phe 


Thr 


Val 


Phe 










455 








460 








465 


He Gin Gly 


He 


Thr 


Val 


Gly 


Pro Leu 


Val 


Arg Tyf Leu Asp 


Val 










470 








475. 








480 


Lys 


Lys 


Thr 


Asn 


Lys 


Lys 


Glu 


Ser lie 


Asn 


Glu Glu 


Leii 


His 


He 






485 








490 








495 


Arg 


Leu 


Met 


Asp 


His 


Leu 


Lys 


Ala Gly 


He 


Glu Asp 


Val 


Cys 


Gly 










500 








505 








510 


His 


Tirp 


Ser 


His 


Tyr 


Gin 


Val 


Arg Asp 


Lys 


Phe Lys 




Phe 


Asp 








515 








520 








525 


His 


Arg 


Tyr 


Leu 


Arg 


Lys 


He 


Leu He 


Arg Lys Asn 


Leu 


Pro 


Lys 










530 








535 








540 


Ser 


Ser 


He 


Val 


Ser 


Leu 


Tyr 


Lys Lys 


Leu 


Glu Met 


Lys 


Gin 


Ala 










545 








550 








555 


He 


Glu 


Met 


Val 


Glu 


Thr 


Gly 


He Leu 


Ser 


Ser Thr 


Ala 


Phe 


Ser 










560 






565 








570 


He 


Pro 


His 


Gin 


Ala 


Gin 


Arg 


lie Gin 


Gly 


lie Lys 


Arg 


Leu 


Ser 










575 








580 








585 


Pro 


Glu 


Asp 


Val 


Glu 


Ser 


He 


Arg Asp 


He 


Leu Thr 


Ser 


Asn 


Met 








590 






595 








600 


Tyr Gin Val Arg Gin Arg Ttxf Leu Ser Tyr Asn Lys Tyr Asn Leu 










605 








610 








615 


Lys 


Pro 


Gin 


Thr 


Ser 


Glu 


Lys 


Gin Ala 


Lys 


Glu He 


Leu 


He 


Arg 








620 








625 








63 0 


Arg 


Gin 


Asn 


Thr 


Leu 


Arg 


Glu 


Ser Met 


Arg Lys Gly His 


Ser 


Leu 








635 








640 








645 


Pro 


Trp Gly Lys 


Pro 


Ala Gly Thr Lys 


Asn 


lie Arg Tyr 


Leu 


Ser 










650 








655 








660 


Tyr 


Pro 


Tyr Gly Asn 


Pro 


Gin 


Ser Ala 


Gly Arg Asp 


Thr Arg Ala 








665 








670 








675 


Ala Gly Phe 


Ser Gly 


Lys 


Leu 


Pro Thr 


Trp 


Leu Leu 


Cys 


Cys 


Phe 










680 








685 








690 


Ser 


Val 


Glu 


Ser Gly Gly Lys 


Tyr Leu 


Gly Val Trp Ala Lys 


Arg 










695 








700 








705 



Gin His 
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<223> Incyte ID No: 7477852CD1 

M. <400> 22. . ' ' • 

Met Gly Gly Phe Leu Pro Lys Ala Glu GXy Pro Gly Ser Gin Leu 
1 5 . ■ vio , , • , . . '15 

Gin Lys Leu Leu Pro Ser Phe Leu Val Arg Glu Gin Asp Trp Asp 

20 .25 . , . 30 . 

Gin His Leu Asp Lys Leu His Met Leu Gin Gin Lys Xrg lie Leu 
35 40 * . 45 

Glu Ser Pro Leu Leu Arg Ala Ser Lys Glu Asn Asp Leu Ser Val 
50 55 60 

Leu Arg Gin Leu Leu Leu Asp cys Thr Cys Asp Val Arg cjln Arg 
65 ' 70 75- 

Gly Ala Leu Gly Glu Thr Ala Leu His lie Ala Ala L4u Tyr Asp 

80 85 90 . 

Asn Leu Glu Ala Ala Leu Val Leu Met Glu Ala Ala Pro Glu Leu 
95 ^ . 100 105 

Val Phe Glu Pro Thr Thr Cys Glu Ala Phe Ala Gly Gin Thr Ala 
, .110 115 ♦ ■ 120 . 

Leu His lie Ala Val Val . Asn ' Gin Asn Val Asn Leu Val' Arg Ala 
•125 130 135 

Leu Leu Thr Arg Arg Ala Ser Val Ser Ala Arg Ala Thr Gly Thr 

140 . 145 150 

« Ala Phe Arg Argr Ser Pro Arg Asn Leu lie Tyf Phe Gly Glu His 

155 160 165 

Pro Leu Ser Phe Ala Ala Cys Val Asn Ser Glu Glu lie Val Arg 

170 175; , 180 

Leu Leu lie Glu His Gly Ala Asp lie Arg Ala Gin Asp Ser Leu 

185 190 195 

iGly Asn Thr Val Leu His lie Leu lie Leu Gin Pro Asn Lys Thr 

200 205 • 210 

Phe Ala Cys Gin Met Tyr Asn Leu Leu Leu Ser Tyr Asp Gly His 

215 220 225 

Gly Asp His Leu Gin Pro Leu Asp Leu Val Pro Asn His Gin Gly 

230 235 . . 240 

Leu Thr Pro Phe Lys Leu Ala Gly Val Glu Gly Asn Thr Val Met 

245 250 255 

Phe Gin His Leu Met Gin Lys Arg Arg His lie Gin Trp Thr Tyr 

260 265 270 

Gly . Pro Leu Thr Ser He Leu a*yr Asp Leu Thr Glu lie Asp Ser 

275 280 285 

Trp Gly Glu Glu Leu Ser Phe Leu Glu Leu Val Val Ser Ser Asp 

290 295 300 

Ly^ Arg Glu Ala Arg Gin lie Leu Glu Gin Thr Pro Val Lys Glu 

305 310 315 

Leu Val Ser Phe Lys Trp Asn Lys Tyr Gly Arg Pro Tyr Phe Cys 

320 325 330 

lie Leu Ala Ala Leu Tyr Leu Leu Tyr Met lie Cys Phe Thr Thr 

335 340 . 345 

Cys Cys Val Tyr Arg Pro Leu Lys Phe Arg Gly Gly Asn Arg Thr 

350 ' 355 360 

His Ser Arg Asp lie Thr lie Leu Gin Gin Lys Leu Leu Gin Glu. 

365 370 375 

Ala Tyr Glu Thr Arg Glu Asp lie lie Arg Leu Val Gly Glu Leu 

380 385 . 390 

Val Ser lie Val Gly Ala Val lie lie Leu Leu Leu Glu He Pro 

395 - 400 405 

Asp He Phe Arg Val Gly Ala Ser Arg Tyr Phe Gly Lys Thr He 

410 . 415 420 

Leu Gly Gly Pro Phe His Val . He Met He Thr Tyr Ala Ser Leu 

425 430 435 

Val Leu Val Thr Met Val Met Arg Leu Thr Asn Thr Asn Gly Glu 

440 445 450 

Val Val Pro Met Ser Phe Ala Leu Val Leu Gly Trp Cys Ser Val 

455 460 465 

Met Tyr Phe Thr Arg Gly Phe Gin Met Leu Gly Pro Phe Thr He 

470 475 480 
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Met lie Gin Lys Met lie Phe Gly Asp Leu Met Arg Phe Cys Trp 

485 490 495 

Leu Met Ala Val Val, Jle ^teuH Gly Phe Ala -Ser Ala Phe T^-r lie • 

500 505 ' 510. 

lie Phe Gin Thr Qlu Asp P3jo Thr Ser Leu Gly Qln Phe Tyr Asp „ 

515 .520 52^ 

Tyr Pro . Met, Ala Leu Phe Thr Thr Phe*blu Leu Phe Leu Thr Val 

530 535 «. 540 

lie Asp Ala Pro Ala Asn Tyr Asp Val Asp L6u Pro Phe Met Phe 

545 550 . . 555 

Ser lie Val Asn Phe Ala Phe Ala lie lie Ala Thr Leu Leu Met 

560 ; 565 . • 570 

Leu Asn Leu Phe lie Ala Met Met Gly Asp Thr His Trp Arg Val 

575 . , • 580 . ' .- .585 

Ala Gin Glu Arg Asp Glu Leu Trp Arg Ala Gin Val Val Ala Thr 
.590 595 600 

Thr Val Met Leu Glu Arg Lys Leu Pro Arg Cys Leu Trp pro Arg 
• . 605 ' ^ 610 . 615 

Ser Gly He Cys Gly* Cys Glu Phe Gly Leu Gly Asp Arg Trp Phe 

620 625 630 

Leu Arg Val Glu Asn His Asn Asp Gin Asn Pro Leu Arg Val Leu 

635 ' 640 645 

Arg Tyr Val Glu Val Phe Lys Asn Ser Asp Lys Glu Asp Asp Gin 
: ' 650 ^55 J 660 

Glu His Pro Ser Glu Lys Gin Pro Ser Gly Ala Glu Ser Gly Thr 

.665 670 675 

Leu Ala Arg Ala Ser Leu Aid Leu Pro Thr Ser Ser Leu Ser Arg 

680 ' 685 • ■ 690 

Thr Ala Ser G^n Ser Ser Ser His Arg Gly Trp. Glu He Leu Arg 

695* . 700 . 705 

Gin Asn Thr Leu Gly His Leu Asn Leu Gly Leu Asn Leu Ser Glu 

710 715 720. 

Gly Asp Gly Glu Glu Val Tyr His Phe 

725 

<210> 23 . . 

<211> 492 • 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 1471717CD1 

■ » - . . 

<400> 23 . 



Met 


Ala Thr Lys 


Pro 


Thr 


Glu Pro 


Val 


Thr 


He Leu Ser Leu Arg 


1 




5 








10 


15. 


Lys 


Leu Ser Leu 


Gly Thr 


Ala Glu 


Pro 


Gin 


Val Lys Glu Pro Lys 






20 








25 


30 


Thr 


Phe Thr Val 


Glu 


Asp 


Ala Val 


Glu 


Thr 


lie Gly Phe Gly Arg 






35 








40 


45 


Phe 


His He Ala 


Leu 


Phe 


Leu lie 


Met 


Gly 


Ser Thr Gly Val Val 






50 








55 


60 


Glu 


Ala Met Glu 


He 


Met 


Leu He 


Ala 


Val. 


Val Ser Pro Val He 






65 








70 


75 


Arg 


Cys Glu Trp 


Gin 


Leu 


Glu Asn 


Trp 


Gin Val Ala Leu Val Thr 






80 






85 


90 


Thr 


Met Val Phe 


Phe 


Gly 


Tyr Met Val 


Phe 


iSer He Leu . Phe Gly 






95 








100 


105 


Leu 


Leu Ala Asp 


Arg 


Tyr 


Gly Arg 


Trp 


Lys 


He Leu Leu He Ser 






110 








115 


120 


Phe 


Leu Trp Gly 


Ala 


Tyr 


Phe Ser 


Leu 


Leu 


Thr Ser Phe Ala Pro 






125 








130 


135. 


Ser 


Tyr He Trp 


Phe 


Val 


Phe Leu 


Arg 


Thr 


Met Val Gly Cys Gly 






140 








145 


150 


Val 


Ser Gly His 


Ser 


Gin 


Gly Leu 


He 


He 


Lys Thr Glu Phe Leu 






155 








160 


165 
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Pro 


Thr 


Lys 


Tyr 


Arg 


Gly Tyr Met Leu 


Pro 


Leu Ser 


Gin 


Val 


Phe 








170 




175 








180 


Trp 


Leu 


Ala 


Gly 


Ser t 


Leu 'Leu*' He He 


Gly 'Leu Ala 


Ser 


Vdl 


He 








185 




190 








195- 


lie 


Pro 


Thr 


He Gly Trp Arg Trp Leu 


He 


Arg Val 


Ala 


Ser 


He 










200 




205 








210' 


Pro 


Giy 


He. 


He Leu He Val Ala Phe Lys 


Phe lie 


Pro 


Glu 


Ser 








215 




220 








225 


Ala 


Arg 


Phe 


Asn 


Val 


Ser Thr Gly Asn Thr Arg Ala Ala Leu Ala 








23 0 




235 








240 


Thr 


Leu Glu Arg Val 


Ala Lys Met Asn 


Arg 


Ser Val 


Met 


Pro 


Glu 










245 




250 








255 


Gly 


Lys 


Leu 


Val 


Glu 


Prp Val Leu Glu 


Lys 


Arg Gly Arg 


Phe 


Ala 






260 


1 


265 








270 


Asp Leu Leii Asp Ala Lys Tyr Leu Arg 


Thr 


Thr Leu 


Gin 


He 


Trp 










.275 




280 








285 


Val 


He 


Trp, 


Leu Gly He Ser Phe Ala Tyr Tyr Gly Val 


He 


Leu 








290 




295 








300 


Ala 


Ser 


Ala 


Glu 


Leu* 


Leu Glu Arg Asp 


Leu 


Val Cys' Gly Ser 


Lys 










305 

«J V/ -J 




310 








315 


Ser Asp 


Ser 


Ala 


Val 


Val Val Thr Gly Gly Asp Ser Gly Glu 


Ser 










320 


Cys His Met Phe 


325 








330 


Gin 


Ser 


Pro 


Cys 


Tyr 


Ala 


Pro Ser Asp 




Arg 








335 




340 








345 


Thr 


Met 


He 


He 


Ser 


Thr He Gly Glu 


He 


Ala Leu 


Asn 


Pro 


Leu 










350 




355 








360 


Asn 


He 


Leu 


Gly 


He 


Asn I^hd Leu Gly 


Arg 


Arg Leu 


Ser 


Leu 


Ser 








365 




370 




1 




375 


He 


Thr 


Met Gly 


Cys 


Thr Ala Leu Phe 


Cys 


Leu Leu 


Leu 


Asn 


He 










380 




385 








390 


Cys 


Thr 


Ser 


Ser 


Ala 


Gly Lgu He Gly 


Phe 


Leu Phe 


Met 


Leu 


Arg 








395 




400 








405 


Ala 


Leu 


Val 


Ala 


Ala 


Asn Phe Asn Thr 


Val 


Tyr He 


Tyr 


Thr 


Ala 










410 




415 








420 


GlU 


Val 


Tyr 


Pro 


Thr 


Thr Met Arg Ala 


Leu 


Gly Met 


Gly 


Thr 


Ser 








425 




430 








435 


Gly 


Ser 


Leu 


Cys 


Arg 


He Gly Ala Met 


Val 


Ala Pro' 


Phe 


He 


Ser 








440. 




445 








450 


Gin 


Val 


Leu 


Met 


Ser 


Ala Ser He Leu 


Gly Ala Leu 


Cys 


Leu 


Phe 










455 




460 








465 


Ser 


Ser 


Val 


Cys 


Val 


Val. cys Ala He 


ser 


Ala Phe 


Thr 


Leu 


Pro 








47 0 




475 








480 


He 


Glu 


Thr 


Lys 


Gly Arg Ala Leu Gin Gin 


He Lyq 
















485 




490 
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<400> 24 



Met Asn 


Met 


Lys 


Gin 


Lys 


Ser 


Val Tyr. 


Gin Gin Thr Lys Ala 


Leu 


1 








5 








10 


15 


Leu 


Cys 


Lys 


Asn 


Phe 


Leu 


Lys 


Lys Trp 


Arg Met Lys Arg Glu 


Ser 










20 








25 


30 


Leu 


Leu 


Glu 


Trp 


Gly 


Leu 


Ser 


He Leu Leu Gly Leu Cys He Ala 










35 








40 


45 


Leu 


Phe 


Ser 


Ser 


Ser 


Met 


Arg Asn Val 


Gin Phe Pro Gly Met 


Ala 










50 








55 


60 


Pro 


Gin 


Asn 


Leu 


Gly Arg 


Val 


Asp Lys 


Phe Asn Ser Ser Ser 


Leu 










65 






70 


75 


Met 


Val 


val 


Tyr 


Thr 


Pro 


He 


Ser Asn 


Leu Thr Gin Gin He 


Met 








80 








85 


90 
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Asn fiys 


Thr 


Ala 


Leu 


Ala 


Pro 










95 






Gly Ala 


Pro 


Asn 


Lvs 


Thr 


His 










110 










Tyr 


Ala 


Met 


Gly 


He 










125 




isys 


LeU 


He 


Phe 


Phe 


Gin Gly 










140 






Asp 


Phe 


Ser 


Ala 


His 


Cys 


Trp 










155 








Leu 


Thr 


Lvs 


TVT 


Trp Asn 










170 






AT a 


lie 


Asn 


Thr 


Ala. 


■ He 


He 










185 






Stair 


Val Leu 


Leu 


Lvs 


Lys 


Ala 










200 








Leu 


Thr 


Leu 


Phe 


Trp Gly 










215 








Gin 


Leu 


Pro 


Ser 


Ser 


Leu 


1 

I 








230 








Phe 


Ala 


Phe. 


Thr 


Thr 


Gly 










245 




Tyr 


Asn 


Leu 


Asn 


Glv 


Val 


lie 










260 






Tyr 


Thr 


Met 


lie 


Ala 


Thr 


Phe 










275 








Tyr 


Leu 


Leu 


Leu 


Ala 


Leu 










290 






T ^ r 

vaxy 


Asp 


Glu 


Arg 


His 


Tyr 


Ser. 










305 








Cys 


Phe 


Gin 


His 


Gin Arg 










320 








He 


Asp 


Ala 


Glu 


His 


Pro 










335 






Ala 


Pro 


Glu 


Phe 


Gin 


Gly Lys 










350 






Lys 


Lys 


Glu 


T\/7" 

xy J. 


Lys 


Gly Lys 










3 65 






Gly 


Leu 


Leu 


Phe 


Asp 


He 


Tyr 










380 




Gly 


His 


Ser 


Glv 


Ala 


Gly 


Lys 










395 






Gly Leu 


Ser 


Val 


Pro 


Thr 


Glu 










410 






Asn 


Leu 


Ser 


Glu 


Met 


Gin Asp 










425 






Gly Val 


Cys 


Pro. 


Gin 


Phe 


Asn 










440 






Lys 


Glu 


Asn 


Leu 


Ser 


Leu 


Phe 








455 






Lys 


Glu 


Val 


Glu 


Gin 


Glu 


Val 










470 






Met 


Gin 


Asn 


He 


Gin 


Asp 


Asn 










485 






Gin 


Lys 


Arg 


Lvs 


Leu 


Thr 


Phe 


Gin 








500 






He 


Leu 


Leu 


Leu 


Asp 


Glu 










515 




Ser 


Arg 


Asp 


Gin 


Val 


Trp 


Ser 










530 






His 


Val 


He 


Leu 


Phe 


Ser 


Thr 










545 






Leu 


Ala 


Asp 


Arg 


Lys 


Val 


ile 










560 






Ala Gly 


Ser 


Ser 


He 


Phe 


Leu 










575 






His 


Leu 


Ser 


Leu 


His 


Arg Asn 



Leu 


Leu 


Lys 


Gly Thr 


Ser 


vax 


xxe 






100 










X Uo 


Met 


Asp 


Glu 


He 


Leu 


Leu 


VsXU 


Asn 






115 












lie, Phe 


Asn Glu Thr 


jrne 


Ser 


Tyr 






130 






t 


•X JO 


Ty^r Asn 


Ser 


Pro 


Leu 


xrp 


Lys 


t^rXU 






145 






XbU 


Asp 


Gly 


Tyr 


Gly Olu 




Ser 


Cys 






160 










XOD 


Arg Gly 


Phe 


Val 


Ala 




urXXl 


Thr 






.175 










XOU 


Glu 


Val 


Ala Leu Val 


jrllfs 


; 

Leu - 


neu 






190 






1 






Val 


• Leu 


Thr 


Asn 


Leu 


VAX 


VOL J. 


Jrne 






205 










^ X u 


Cys 


Leu 


Gly 


Phe 


Thr 


VclX 


IrXie 


Tyr 






220 












Glu 


Trp 


He 


Leu 


Asn 


X JL6 


cys 


Ser 






235 










Met 


He 


Gin 


lie 


lie 


Lys 


Leu 


Asp 






250 












Phe 


Pro 


Asp 


Pro 


Ser 


CaXy 


Asp 


Ser 






265 










270. 


Ser 


Met 


Leu 


Leu 


Leu 


Asp 


Gly 


Leu 






280 








D tr 


Tyr 


Phe 


Asp Lys 


He 


Leu 


Pro 


Tyr 






295 








•5 A A 


Pro 


Leu 


Phe 


Phe 


Leu 


Asn 


Ser 


Ser 






310 










315 


Thr 


Asn 


Ala 


Lys 


Val 


iLle 


Glu 


Lys 






325 










*D o r» 
O J U 


Ser .Asp 


Asp 


Tyr Phe 


■ 

Glu 


Pro 


Val 






340 










345 


Glu 


Ala 


lie 


Arg 


lie 


Arg 


Asn 


Val 






355 










360 


Ser 


Gly 


Lys 


Val 


Glu 


Ala 


Leu 


Lys 






370 










o T tr 
37b 


Glu Gly 


Gin 


He 


Thr 


TV 1 — 

Ala 


He 


Leu 






385 










"3 o n 

o y u 


Ser 


Ser 


Leu 


Leu 


Aisn 


xxe 


T rti 1 

Leu 


Asn 






400 












Gly Ser 


Val 


Thr 


Ile 


Tyr 


TV «-«<>— 

Asn 


Lys 






415 










/I 'J A 


Leu 


Glu 


Glu 


Ile 


Arg 


Lys 


-xxe 


Thr 




Gin 


43.0 










43d 


Val 


Phe 


Asp 


lie 


Leu 


Tiir 


Val 






445 










>l C A 


Ala 


Lys 


Ile 


Lys 


Gly 


xxe 


HIS. 


Leu 






460 












Gin 


Arg 


He 


Leu 


Leu 


vvXU. 


Leu 


Asp 






475 










>1 Q A 
4 o U 


Leu 


Ala 


Lys 


His 


Leu 




^XU, 


Ol tT- 






490 












Gly 


lie 


Thr 


lie 


Leu 


VaXy 


Asp 


Pro 






505 










c:i A 

DXU 


Pro 


Thr 


Thr 


Gly Leu 




Pro 


irne 






520 












Leu 


Leu 


Arg 


Glu Arg 




rxXci 


Asp 






535 












Gin 


Ser 


Met 


Asp 


Glu 




Asp 


Tl 

xxe 






550 








555 


Met 


Ser 


Asn 


Gly Arg 


Leu 


Lys 


Cys 






565 










570 


Lys 


Arg 


Arg 


Trp Gly 


Leu 


Gly 


Tyr 






580 










585 


Glu 


He 


Cys 


Asn 


Pro 


Glu 


Gin 


He 
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590 










595 






600 




Ser 


Phe 


He 


Thr 


His 


His 


He 


Pro 


Asp Ala Lys Leu 


Lys Thf* 










605 










610 






615 


Glu 


Asn 


Lys 


Glu 


Lys 


Leu 


Val 


Tyr 


Thr 


Leu 


Pro 


Leu . Glu 


Arg Thr 








620 










625 






630 


Asn 


Thr 


Phe 


Pro 


Asp 


Leii 


Phe 


Ser Asp 


Leu Asp Lys Cys 


Ser Asp 








.635 










640 




Ttr lieu 


645 


Gin 


Gly Val 


Thr 


Gly 


Tyr 


Asp 


He 


Ser 


Met 


Ser 


Asn Glu 








650 










655 






660 


Val 


Phe 


Met 


Lys 


Leu 


Glu 


Gly 


Gin 


Ser 


Thr 


He Glu Gin 


Asp Phe 










665 










670 






675 


Glu 


Gin 


Val 


Glu 


Met 


He 


Arg 


Asp 


Ser 


Glu 


Ser 


Lqu Asn 


Glu Met 










680 










685 






690 


Glu 


Leu 


Ala 


His 


Ser 


Ser 


Phe 


Ser 


Glu 


Met 


Gin 


Thr Ala 


Val Ser 










695 










700 






705 


Asp 


Met 


Gly 


Leu 


TlD 


Arg 


Met 


Gin 


Val 


Phe Ala Met Ala Arg Leu 






710 










715 






720 


Arg 


Phe 


Leu 


Lys 


Leu 


Lys 


Arg 


Gin 


Thr 


Lys 


Val- 


Leu Leu 


Thr Leu 






725 










730 






735 


Leu 


Leu 


Val 


Phe 


Gly 


He 


Ala' 


lie 


Phe 


Pro 


Leu He Val 


Glu Asn 










.740 










745 






750 


He 


He 


Tyr 


Ala 


Met 


Leu 


Asn 


Glu 


Lys 


lie Asp Trp Glu Phe Lys 








755 










760 






765 


Asn Glu Leu Tyr Phe Leu Ser Pro Gly Gin Leu Pro Gin Glu Pro 










.770 










775 






780 


Arg. 


Thr 


Ser 


Leu 


Leu 


He 


He 


Asn 


Asn 


Thr 


Glu 


Ser Asn 


• He Glu 








785 










790; 






795 


Asp 


Phe 


He 


Lys 


Ser 


Leu 


Lys 


His 


Gin 


Asn 


He 


Leu Leu 


Glu Val 






800 










805 






810 


Asp 


Asp 


Phe 


Glu 


Asn Arg Asn Gly 


Thr 


Asp 


Gly Leu Ser Tyr Asn 






815 










820 








Gly Ala 


lie 


He 


Val 


Ser Gly Lys 


Gin 


Lys 


Asp Tyr Arg 


irne oej_ 










830 










835 






840 


Val 


Val 


Cys 


Asn 


Thr 


Lys 


Arg 


Leu 


His 


Cys 


Phe 


Pro He 


Leu Met 








845 










850 






855 


Asn 


He 


JL j.e 


Ser Asn Gly Leu Leu Gin Met 


Phe 


Asn His 


Thr Gin 










860 










865 






870 


His 


He 


Arg 


He 


Glu 


Ser 


Ser 


Pro 


Phe 


Pro 


Leu 


Ser His 


He Gly 








875 










880 






885 


Leu 


Trp 


Thr 


Gly 


Leu 


Pro 


Asp 


Gly 


Ser 


Phe 


Phe 


Leu Phe 


Leu Val 






890 










895 






900 


Leu 


Cys 


Ser 


He 


Ser 


Pro 


Tyr 


He 


Thr 


Met 


Gly 


Ser He 


Ser Asp 








905 










910 






915 


Tyir 


Lys 


Lys 


Asn 


Ala 


Lys 


Ser 


Gin 


Leu 


Trp 


He 


Ser Gly 


Leu Tyr 








920 










925 






930 


Thr 


Ser 


Ala 


Tyr 


Trp 


Cys Gly Gin Ala 


Leu- 


Val 


Asp Val 


Ser Phe 








935 










940 






945 


Phe 


He 


Leu 


He 


Leu 


Leu 


Leu 


Met 


Tyr 


Leu 


He 


Phe Tyr 


He Glu 










950 










955 






960 


Asn 


Met 


Gin 


Tyr 


Leu 


Leu 


He 


Thr 


Ser 


Gin 


He 


Val Phe 


Ala Leu 








965 










970 






975 


Val 


He 


Val 


Thr 


Pro 


Gly Tyr Ala Ala 


Ser 


Leu 


Val Phe 


Phe He. 










980 










985 






990 


Tyr 


Met 


He 


Ser 


Phe 


He 


Phe 


Arg 


Lys 


Arg 


Arg 


Lys Asn 


Ser Gly 








995 








1000 






1005 


Leu 


Trp 


Ser 


Phe 


Tyr 


Phe 


Phe 


Phe 


Ala 


Ser 


Thr 


He Met 


Phe Ser 








1010 










1015 






1020 


He 


Thr 


Leu 


He 


Ash 


His 


Phe 


Asp 


Leu 


Ser 


He 


Leu He 


Thr Thr 










1025 










1030 






1035 


Met 


Val 


Leu 


Val 


Pro 


Ser 


Tyr 


. Thr 


Leu 


Leu 


Gly Phe Lys 


Thr Phe 










1040 








1045 






1050 


Leu 


Glu 


Val 


Arg 


Asp 


Gin 


Glu 


His 


Tyr 


Arg 


Glu 


Phe Pro 


Glu Ala 










1055 










1060 






1065 


Asn 


Phe 


Glu 


Leu 


Ser 


Ala 


Thr 


Asp 


Phe 


Leu 


Val 


Cys Phe 


He Pro 










1070 








1075 






1080 


Tyr 


Phe 


Gin 


Thr 


Leu 


Leu 


Phe 


Val 


Phe 


Val 


Leu Arg Cys 


Met Glu 








1085 










1090 






1095 
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Leu Lys Cys Gly Lys Lys Arg Met Arg Lys Asp Pro Val Phe Arg 

1100 1105 1110 

lie Ser Pro Gin Seri Arg 'Asp* 'Ala Lys Pro Apn Pro Glu Glu Pro • 

1115 1120 1125. 

He Asp Glu Asp Glu Asp I3,e Gin Tbr Glu Arg He Arg Thr Val - 

1130 1135 1140* 

Thr Ala.Leui Thr Thr Ser lie Leu Asp'6lu Lys Pro Val He He 

1145 1150 ' . 1155 

Ala Ser Cys Leu His Lys Glu Tyr Ala Gly Gin Lys Lys Ser Cys • 

1160 1165 . 1170 

Phe Ser Lys Arg Lys Lys Lys He Ala Ala Arg Asn He ser Phe 

1175 1180 • 1185 

Cys Val Gin Glu Gly Glu He Leu Gly Leu Leu Gly Pro Ser Gly 

1190 , • 1195 • • . 1200 

Ala Gly Lys Ser Ser Ser He Arg Met He Ser Gly He Thr Lys 

1205 1210 1215 

Pro Thr Ala Gly Glu Val Glu Leu Lys Gly Cys Ser Seir Val Leu 
V 1220 ' ^ . 1225 . 123 0 

Gly His Leu Gly Tyr* Cys Pro Gin Glu Asn Val Leu Trp Pro Met 

1235 1240 1245 

Leu Thr Leu Arg Glu His Leu Glu Val Tyr Ala Ala Val Lys Gly 

1250 . 1255 1260 

Leu Arg Glu Ala Asp Ala Arg Leu Ala He Ala Arg Leu Val. Ser 
. ' 1265 1270 I 1275 

Ala Phe Lys Leu His Glu Gin Leu Asn Val Pro Val Gin Lys Leu 

1280 1285 1290 

Thr Ala Gly He Thr Arg Lys« Leu Cys Phe Val Leu Ser Leu Leu 

1295 1300 ' 1305 

Gly Asn Ser P3;o Val Leu Leu Leu Asp Glu Pro Ser Thr Gly He 

1310' 1315 . 1320 

Asp Pro Thr Gly Gin Gin Gin Met Trp Gin Ala He Gin Ala Val 

1325 1330 1335 

Val Lys Asn Thr Glu Arg Gly Val Leu Leu Thr Thr His Asn Leu 

1340 1345 1350 

Ala Glu Ala Glu Ala Leu Cys Asp Arg Val Ala He Met Val Ser 

1355 1360 1365 

Gly Arg Leu Arg Cys He Gly Ser He Gin His Leu* Lys Asn Lys 
. 1370 1375 1380 

Leu Gly Lys Asp Tyr He Leu Glu Leu Lys Val Lys Glu Thr Ser 

1385 1390 1395 

Gin Val Thr Leu Val His. Thr Glu He Leu Lys Leu Phe Pro Gin 

1400 1405 1410 

Ala Ala Gly Gin Gin Arg Tyr Ser Ser Leu Leu Thr Tyr Lys Leu 

1415 1420 ' . 1425 

Pro Val Ala Asp Val Tyr Pro Leu. Ser Glii Thr Phe His .Lys Leu 

1430 . 1435 1440 

Glu Ala Val Lys His Asn Phe Asn Leu Glu Glu Tyr Ser Leu Ser 

1445 1450 1455 

Gin Cys Thr Leu Glu Lys Val Phe Leu Glu Leu Ser Lys Glu Gin 

1460 1465 ^ - 1470 

Glu Val Gly Asn Phe Asp Glu Glu He Afep Thr Thr Met Arg Trp 

1475 1480 1485 

Lys Leu Leu Pro His Ser Asp Glu Pro 

1490 

<2ilO> 25 
<211> 774 
<212> PRT 

<213> Homo sapiens. 
<220> 

<221> misc^f eature 

<223> Incyte ID No: 4599654CD1 

<400> 25 

Met Glu Ala Glu Gin Arg Pro Ala Ala Gly Ala Ser Glu Gly Ala 
15 10 15 
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Thr Pro Gly Leu Glu Ala Val Pro Pro Val Ala Pro Pro Pro Ala 
20 ' . 25 ^ 30 

Thr Ala Ala Ser Gly^ »Pro "ile" Pro Lys Ser Gly Pro Glu Pro Lys * 

35 40 45- 

Arg Arg His Leu Gly Thr L^u Leu Gin Pro Thr Val Asn Lys Ph^ i* 

50 ;,55 60 

Ser iieu Arg* Val Phe Gly Ser His Lys Ala Val Glu He Glu Gin. 

65 7Q \- 75 

Glu Arg Val Lys Ser. Ala Gly Ala Trp lie lie His Pro Tyr Ser 
80 85 90 

Asp Phe Arg Phe Tyr Trp Asp Leu lie Met Leu Leu Leu Met Val 
95 100 • 105 

Gly Asn Leu He Val Leu Pro Val Gly He Thr Phe Phe Lys Glu 

lib I . ' 115 ' 120 

Glu Asn Ser Pro Pro Trp He Val Phe Asn Val Leu Ser Asp Thr 
•125 130 135 

Phe Phe Leu Leu Asp Leu Val Leii Asn Phe Arg Thr Gly He Val 

X40/ . . 145 -150 

Val Glu Glu Gly Ala Glu He Leu Leu Ala Pro Arg Ala He Arg 

155 160 165 

Thr Arg Tyr Leu Arg Thr Trp Phe Leu ..Val Asp Leu He. Ser Ser 

170 175 180 

He Pro Val Asp Tyr He Phe Leu Val Val Glu Leu Glu Pro Arg 

185 190 ' 195 

Leu Asp Ala Glu Val Tyr Lys Thr Ala Ai-g Ala Leu Arg He Val 

.200 ^ ■■ 205 210 

Arg Phe Thr Lys He Leu Self Leu Leu Arg Leu Leu Arg Leu Ser 

215 220 ' 225. 

Arg Leu He Arg Tyr He His Gin Trp Glu Glu He Phe His Met 

230' 235 • 240 

Thr Tyr Asp Leu Ala Ser Ala Val V^l Arg He Phe Asn Leu He 

245 250 255 

Gly Met Met Leu Leu Leu Cys His Trp Asp Gly Cys Leu Gin Phe 

260 265 270 

Leu Val Pro Met Leu Gin Asp Phe Pro Pro Asp Cys Trp Val Ser 

275 280 ' 285 . 

He Asn His Met Val Asn His Ser Trp dly Arg Glri Tyr Ser His 

290 295 300 

Ala Leu Phe Lys Ala Met Ser His Met Leu Cys He Gly Tyr Gly 

305 310 315 

Gin Gin Ala Pro Val Gly Met Pro Asp Val Trp Leu Thr Met Leu 

320 325 330 

Ser Met lie Val Gly Ala Thr Cys Tyr Ala Met PhQ He Gly His 

335 340 • 345 

Ala Thr Ala Leu He Gin Ser Leu Asp Ser Ser. Arg Arg Gin Tyr 

350 355 3 60 

Gin Glu Lys Tyr Lys Gin Val Glu Gin Tyr Met Ser Phe His Lys 

365 370 375 

Leu Pro Ala Asp Thr Arg Gin Arg He His Glu Tyr Tyr Glu His 

380 3jB5 390 

Arg Tyr Gin Gly Lys Met Phe Asp. Glu Glu. Ser He Leu Gly Glu 

395 400 405 

Leu Ser Glu Pro Leu Arg Glu Glu He tie Asn Phe Thr Cys Arg 

410 415 420 

Gly Leu Val Ala His Met Pro Leu Phe Ala His Ala Asp Pro Ser 

425 430 435 

Phe Val Thr Ala Val Leu Thr Lys Leu Arg Phe Glu Val Phe Gin 

440 445 450 

Pro Gly Asp Leu Val Val Arg Glu Gly Ser Val Gly Arg Lys Met 

455 460 465 

Tyr Phe He Gin His Gly Leu Leu Ser Val Leu Ala Arg Gly Ala 

470 475 480 

Arg Asp Thr Arg Leu Thr Asp Gly Ser Tyr Phe Gly Glu He Cys 

485 490 495 

Leu Leu Thr Arg Gly Arg Arg Thr Ala Ser Val Arg Ala Asp Thr 

500 505 510 

Tyr Cys Arg Leu Tyr Ser Leu Ser Val Asp His Phe Asn Ala Val 
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515 520 525 

Leu Glu Glu Phe Pro Met Met Arg Arg Ala Phe Glu Thr Vai Ala 

530 535 ' • 540 

Met Asp Arg Leu Leu Arg lie Gly^hys hys Asn Ser-Ile Leu Gin 

545 . • 550 . , . , 555 . . 

Arg Lys Arg Ser Glu Pro Ser Pro Gly Ser Ser Gly Gly lie Met 

560 565 570 

Glu Gin His Leu Val Gin His Asp Arg Asp Met Ala Atg Gly Val 

575 580 • 585 

Arg Gly Arg Ala Pro Ser Thr Gly Ala Gin Leu Ser Gly Lys Pro 

590 595 600 

Val Leu 'Trp Glu Pro Leu Val His Ala Pro Leu Gin Ala Ala Ala 

605 610 ' 615 

Val Thr Ser Asn Val Ala He Ala Leu Thr His Gin Ai-g Gly Pro 

620 625 630 

Leu Pro Leu Ser Pro Asp Ser Pro Ala Thr Leu Leu Ala Arg Ser 

635 640 . 645 . 

Ala Trp Arg Ser Ala Gly s^r Pro Ala Ser Pro- Leu Val Pro Val 
, 650 655 . ' 660 • 

Arg Ala Gly Pro Trp Ala. Ser.. Thr Ser Arg Leu Pro Ala Pro Pro 
. 665 670 675 

Ala Arg Thr Leu His . Ala Ser Leu Ser Arg Ala Gly Arg Ser Gin 

680 . 685 690 

I Val Ser Leu Leu Gly Pro Pro Pro Gly Gly Gly Gly Arg Arg Leu 

695 700 705 

Gly. Pro Arg Gly Arg Pro Leu Ser Ala Ser Gin Pro Ser Leu Pro 

710 715 . , , 720 

Gin Arg Ala Thr Gly Asp Gly Ser Pro Gly Arg Lys Gly . Ser Gly 

725 . 730 735 

Ser Glu Arg Leu Pro Pro Ser Gly Leu Leu Ala Lys Pro Pro Arg 

740 . 745 • 750 

Thr Ala Gin Pro Pro Arg Pro Pro Val Pro Glu Pro Ala Thr Pro 

755 760 765 

Arg Gly Leu Gin Leu Ser Ala Asn Met 

770 

<210> 26 
<211> 614 
<212> PRT 

<2i3> Homo sapiens. 
<220> 

<221> inisc_f eature 

<223> Incyte ID No: 5047435CD1 

<400> 26 

Met Ala Glu Gly Glu Arg Gly Ala Asp Val Pro His Gly Leu Gly 
• 1 5 10 15 

Ala Trp Leu Ala Asp Val Ala Leu Ala Ala Leu Arg Ala Gly Gly 

20 25 . 30 

Gin Gly 'Arg. Arg Asp Arg Gly Gly Gly Gly Pro Glu Ser Leu Ser 

35 40 45. 

Gly Gly Ser Gly Val Gly . Asp Ser Gly Gly Gly Cys Ala Pro Gly 

50 ,55 60 

Pro Ser Ala Pro Pro Ala Arg Arg Arg Val Pro Leu Ala Met Gly 

65 70 75 

His Ser Pro Pro Val Leu Pro Leu Cys Ala Ser Val Ser Leu Leu 

80 85 90 

Gly Gly Leu Thr Phe Gly Tyr Glu Leu Ala Val He Ser Gly Ala 

95 100 . 105 

Leu Leu Pro Leu Gin Leu Asp Phe Gly Leu Ser Cys Leu Glu Gin 
110 115 120 

Glu Phe Leu Val Gly Ser Leu Leu Leu Gly Ala Leu Leu Ala Ser 
125 130 135 

Leu Val Gly Gly Phe Leu lie Asp Cys Tyr Gly Arg Lys Gin Ala 
140 145 150 

lie Leu Gly Ser Asn Leu Val Leu Leu Ala Gly Ser Leu Thr Leu 
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Ala 




155 




160 








165 


Gly 


Leu 


Gly 


Ser Leu 


Ala Trp Leu 


Val 


Leu Gly Arg Ala 


Val 








170 ! ► 




175 








180 • 


Val 


Gly 


Phe 


Ala 


He Ser 
135 . 


Leu Ser Ser 
1 


Met 
190 


Ala Cys 


Cys 


He 


Tyr- 
195. r 


Val 


Ser 


Giu 


"Leu 


Val Gly Pro Arg Gin Arg 


Gly Val 


Leu 


Vai 


Ser 










200 




i65 






210 


Leu 


Tyr 


Glii Ala Gly He Thr Val Gly 


He 


Leu Leu' 


Ser 


Tyr 


Ala 










215 




220' 








225 


Leu 


Asn 


Tyr Ala Leu Ala Gly Thr Pro 


Trp 


Gly Trp Arg His 


Met 










230 




235 








240 


Phe 


Gly 


Trp 


Ala 


Thr Ala 
245 


Pro Ala; Val 


Leu 
250 


Gin Ser* 


Leu 


Ser 


Leu 
255 


Leu 


Phe 


Leu Pro Ala Gly Tho: Asp Glu Thr 


Ala 'Thr 


His 


Lys 


Asp ■ 










260 




265 








270 


Leu 


lie 


Pro 


Leu iGln Gly Gly Glu Ala 


Pro 


Lys Leu Gly Pro 


Gly 










275 




280 








285 


Arg 


Pro 


Arg Tyr 


Ser Phe 


Leu ^sp Leu 


Phe 


Arg Ala T^g Asp Asn 










290' 


295 








300 


Met 


Arg 


Gly Arg 


Thr Thr 


Val Gly Leu 


Glv 


Leu Val 


Leu 


Phe 


Gin 










305 




310 








315 


Gin 


Leu 


Thr 


Gly Gin Pro 


Asn Val Leu 


Cys 


Tyr Ala Ser Thr 


He 










320 




325 








33 0 


Phe 


Ser 


Ser 


Val 


Gly Phe 
335 


His Gly Gly 


340 


Ser Ala 


Val 


L4u 


Ala 
345 


Ser 


Val 


Gly Leu .Gly Ala Val Lys Val 
350 » V 




Ala Thr 


Leu 


Thr 


Ala 






355 








360 


Met 


Gly 


Leu 


Val 


Asp Arg Ala Gly Arg 




Ala Leu 


Leu' 


'Leu 


Ala 










365 




370 








375 


Gly 


Cys 


Ala 


Leu Met Ala Leu Ser Val 




Gly He Gly .Leu Val 










380 




385 








390 


Ser 


Phe 


Ala 


Val 


Pro Met Asp Ser Gly 


XT J. W 


Ser Cys 


Leu 


Ala 


Val 










395 




400 








405 


Pro 


Asn 


Ala 


Thr Gly Gin 


Thr Gly Leu 


Pro 


Gly Asp 


Ser 


Gly 


Leu 










410 




415 








420 


Leu 


Gin 


Asp 


Ser 


Ser Leu 
425 


Pro Pro lie 


Pro 
43 0 


Arg Thr 


Asn 


Glu 


Asp , 
435 


Gin 


Arg 


Glu 


Pro 


lie- Leu 
440 


Ser Thr Ala 


Lys 
445 


Lys Thr 


Lys 


Pro 


His 
450 


Pro 


Arg 


Ser 


Gly Asp Pro 


ser Ala Pro 


Pro 


Arg Leu 


Ala 


Leu 


Ser 










455 




460 








465 


Ser 


Ala 


Leu 


Pro 


Gly Pro 


Pro Leu Pro 


Ala 


Arg Gly 


His 


Ala 


Leu 










47 0 




475 


Val Phe 






480 


Leu 


Arg 


Trp 


Thr 


Ala Leu 
485 


Leu Cys Leu 


Met 
490 


Val 


Ser 


Ala 
495 


Phe 


Ser 


Phe 


Gly 


Phe Gly 
500. 


Pro Val Thr 


Trp 
505 


Leu Val 


Leu 


Ser 


Glu 
510 


He 


Tyr 


Pro 


Val 


Glu He 
515 


Arg Gly Arg 


Ala 
520 


Phe Ala 


Phe 


Cys 


Asn 
525 


Ser 


Phe 


Asn 


Trp 


Ala Ala 
530 


Asn Leu Phe 


He 
535 


Ser Leu 


Ser 


Phe 


Leu 
540 . 


Asp 


Leu 


He 


Gly 


Thr He 
545 


Gly Leu Ser 


Trp 
550 


Thr Phe 


Leu 


Leu 


Tyr 
555 


Gly 


Leu 


Thr 


Ala 


Val Leu 


Gly Leu Gly Phe 


He Tyr 


Leu 


Phe 


Val 










560 




565 








570 


Pro 


Glu 


Thr 


Lys 


Gly Gin 


Ser Leu Ala 


Glu 


He Asp Gin Gin 


Phe 










575 




580 








585 


Gin 


Lys 


Arg 


Arg 


Phe Thr 


Leu Ser Phe 


Gly 


His Arg Gin Asn 


Ser 


Thr 








590 




595 








600 


Gly 


He 


Pro 


Tyr Ser 
605 


Arg He Glu 


He 
610 


Ser Ala 


Ala 


Ser 





<210> 27 

<211> 2180 

<212> PRT 

<213> Homo sapiens 
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<221> misc_feature 

<223> Incyte ID No: 7475603CD1 

<400> 27 . ' 

Met Arg Phe' Arg hys Gly Gin Glu *Iieu ^ro Aid Ala Ala ^ro His- 
1 . 5 . 10 . • 15 

Val Phe Ser Pro Thr Val Val Leu Thr Ser Leu Ser Arg Pre Leu 
20 25 . 30 

i>ro Ser Leu Thr Met Ala Phe Trp Thr Gin Leu Met Leu Leu Leu 
.35 40 45 

Tip Lys Asn Phe Met Tyr Arg Arg Arg Gin Pro. Val Gin Leu Leu 
50 . , -55 . ' 60 

Val Glu Leu Leu Trp Pro Leu Phe Leu Phe Phe He L^u Val Ala • 

65 70 • 75 

Val Arg His Ser His Pro Pro Leu Glu His His Glu Cys His Phe 
80 85 90 

Pro Asn Lys Pro Leu Pro Sar Ala Gly Thr Val Pro Trp Leu Gin 
95 100 / ■ ' 105 

Gly Leu He Cys Asn Val Asn Asn Thr Cys Phe Pro Gin. Leu Thr 
.110 • 115 120 

Pro Gly Glu Glu Pro Gly Arg Leu Ser Asn Phe Asn Asp Ser Leu 

125 130 135 

Val Ser Arg Leu Leu Ala Asp Ala Arg Thr Val Leu Gly Gly Ala 

140 145 150 

Ser Ala His Arg Thr Leu Ala Gly Leu Gly Lys Leu He Ala Thr 

155 160 165 

Leu Arg Ala Ala Arg Ser Thr Ala Gin Pro 'Gin Pro Tht Lys Gin 

170 175 180 

Ser Pro Leu Glu Pro Pro Met Leu Asp Val Ala Glu Leu Leu Thr 

185 190 ' , 195 

Ser Leu Leu Arg Thr Glu Ser Leu Gly Leu Ala Leu Gly Gin Ala 

200 205 210 

Gin Glu Pro Leu His Ser Leu Leu Glu Ala Ala Glu Asp Leu Ala 

215 220 225 

Gin Glu Leu Leu Ala Leu Arg Ser Leu Val Glu Leu Arg Ala Leu 

230 235 240 

Leu Gin Arg Pro Arg Gly Thr Ser Gly Pro Leu Glu Leu Leu Ser 

245 250 . 255 

Glu Ala Leu Cys Ser Val Arg Gly Pro Ser Ser Thr Val Gly Pro 

260 265 270 . 

Ser Leu Asn Trp Tyr Glu Ala Ser Asp Leu Met Glu Leu Val Gly 

275 280 285 

Gin Glu Pro- Glu Ser Ala Leu Pro Asp Ser Ser Leu Ser Pro Ala 

290 295 300 

Cys Ser Glu Leu He Gly Ala Leu Asp Ser His Pro Leu Ser Arg 

305 310 315 

Leu Leu Trp Arg Arg Leu Lys Pro Leu He Leu Gly Lys Leu Leu 

320 325 330 

Phe Ala Pro Asp Thr Pro Phe Thr Arg Lys Leu Met Ala Gin Val 

335 . 340 345 

Asn Arg Thr Phe Glu Glu Leu Thr Leu Leu Arg Asp Val Arg Glu 

350 355 360 ' 

Val Trp Glu Met Leu Gly Pro Arg He Phe Thr Phe Met Asn Asp 

365 370 375 

Ser Ser Asn Val Ala Met Leu Gin Arg Leu Leu Gin Met Gin Asp 

380 385 390 

Glu Gly Arg Arg Gin Pro Arg Pro Gly Gly Arg Asp His Met Glu 

395 400 405 

Ala Leu Arg Ser Phe Leu Asp Pro Gly Ser Gly Gly Tyr Ser Trp 

410 415 420 

Gin Asp Ala His Ala Asp Val Gly His Leu Val Gly Thr Leu Gly 

425 430 435 

Arg Val Thr Glu Cys Leu Ser Leu Asp Lys Leu Glu Ala Ala Pro 

440 445 450 

Ser Glu Ala Ala Leu Val Ser Arg Ala Leu Gin Leu Leu Ala Glu 

455 460 465 
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His 


Arg 


Phe 


Trp 


Ala 


Gly 


Val 


Val Phe 


Leu 


Gly Pro Glu Asp 


Ser 










470 








475 




480 


Ser 


Asp 


Pro 


Thr 


Glu 


His 


Pro 


Thr Pro 


Asp 


Leu Gly Pro Gly 


His 










485 








490 




495 


Val 


Arg 


He. 


Lys 


He 


Arg 


Met 


Asp » He 


Asp 


Val Val Tl^r Arg 


Thr 










500 








505 




510 


Asn 


Lys 


He 


Arg 


Asp 


Arg 


Phe 


Trp Asp 


Pro 


Gly Pro Ala Ala. 


Asp 










515 






520 


525 


Pro 


Leu 


Thr 


Asp 


Leu 


Arg 


Tyr Val Trp Gly Gly Ph^ Val Tyr 


Leu 










530 








535 




540 


Gin 


Asp 


Leu 


Val 


Glu 


Arg 


Ala 


Aia Val 


Arg Val Leu Ser Gly 


Ala 










545 








550 




555 


Asn 


Pro 


Arg 


Ala 


Gly Leu 


Tyr 


Leu Gin 


Gin 


Met Pro Tyr Pro 


Cys 










560 








565 




570 


Tyr 


Val Asp 


Asp 


Val 


Phe 


Leu Arg Val 


Leu Ser Arg Ser Leu 


Pro 










575 








580 




585 


Leu 


Phe 


Leu 


Thr 


Leu 


Ala 


Tapp 


lie Tyr 


Ser 


Val Thr Leu Thr 


Val 










590 






595 


600 




Ala 


Val 


Val 


Arg Glu Lys Glu Thr Arg Leu Arg Asp Thr 


Met 








605 








610 




615 


Arg 


Ala 


Met 


Gly 


.Leu 


Ser Arg Ala Val 


Leu 


Trp Leu. Gly Trp 


Phe 










.620 








625 




630 


Leu 


Ser 


Cvs 


Leu 


Glv 


Pro 


Phe 


Leu Leu 


Ser 


Ala Ala Leu Leu 


Val 


• 








635 








640 




(545 


Leu 


Val 


Leu 


Lys 


Leu 


Gly Asp 


He Leu 


Pro 


Tyr Ser His Pro 


Gly 










650 








655 




660 


Val 


Val 


Phe 


Leu 


Phe 


Leu 


Ala 


Ala Phe 


Ala, Val Ala Thr Val 


Thr 










665 








670 




675 


Gin 


Ser 


Phe 


Leu 


Leu 


Ser 


Ala 


Phe Phe 


Ser 


Arg Ala Asn Leu 


Ala 










680 








685 




690 


Ala 


Ala 


Cys 


Gly 


Glv 


Leu 


Ala 


Tyr Phe 


Ser 


Leu Tyr Leu Pro 


Tyr 










695 








700 




705 


Val 


Leu 


Cys 


Val 


Ala 


Trp 


Arg 


Asp Arg 


Leu 


Pro Ala Gly Gly Arg 










710 








715 




720 


Val 


Ala 


Ala 


Ser 


Leu 


Leu 


Ser 


Pro Val 


Ala 


Phe Gly Phe Gly 


Cys 










725 








730 


735 


GlU 


Ser 


Leu 


Ala 


Leu 


Leu Glu Glu Gin Gly Glu Gly Ala Gin Trp 










740 








745 




750 


His 


Asn 


Val 


Gly 


Thr 


Arg 


Pro 


Thr Ala 


Asp 


Val Phe Ser Leu 


Ala 










755 








760 




765 


Gin 


Val 


Ser 


Gly 


Leu 


Leu 


Leu 


Leu Asp Ala Ala Leu Tyr Gly Leu 










770 








775 




780 


Ala 


Thr 


Trp 


Tyr 


Leu 


Glu 


Ala 


Val cys 


Pro 


Gly Gin Tyr Gly 


He 










785 








790 




795 


Pro 


Glu 


Pro 


Trp 


Asn 


Phe 


Pro 


Phe Arg Arg 


Ser Tyr Trp. cys 


Gly 










800 








805 




810 


Pro 


Arg 


Pro 


Pro 


Lys 


Ser 


Pro 


Ala Pro 


Cys 


Pro Thr Pro Leu 


Asp 










815 








820 




825 


Pro 


Lys 


Val 


Leu 


Val 


Glu 


Glu 


Ala Pro 


Pro 


Gly Leu Ser Pro 


Gly 










830 








835 




840 


Val 


Ser Val 


Arg 


Ser 


Leu 


Glu 


Lys Arg 


Phe 


Pro Gly Ser Pro 


Gin 










845 








850 


855 


Pro 


Ala 


Leu 


Arg 


Gly 


Leu 


Ser 


Leu Asp 


Phe 


Tyr Gin Gly His 


He 










860 








865 




870 


Thr 


Ala 


Phe 


Leu 


Gly 


His Asn Gly Ala 


Gly 


Lys Thr Thr Thr 


Leu 










875 








880 


885 


Ser 


He 


Leu 


Ser 


Gly 


Leu 


Phe 


Pro Pro 


Ser 


Gly Gly Ser Ala 


Phe 










890 








895 


900 


lie 


L'eu Gly 


His 


Asp 


Val 


Arg 


Ser Ser 


Met 


Ala Ala He Arg 


Pro 










905 








9io 




915 


His 


Leu Gly 


Val 


Cys 


Pro 


Gin 


Tyr Asn 


Val 


Leu Phe Asp Met 


Leu 










920 








925 


930 


Thr 


Val 


Asp 


Glu 


His 


Val Trp 


Phe Tyr 


Gly Arg Leu Lys Gly 


Leu 










935 








940 




945 


Ser 


Ala 


Ala 


Val 


Val 


Gly 


Pro 


Glu Gin 


Asp 


Arg Leu Leu Gin 


Asp 










950 








955 




960 


Val 


Gly Leu 


Val 


Ser 


Lys 


Gin 


Ser Val 


Gin 


Thr Arg His Leu 


Ser 
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965 970 975 

Gly Gly Met Gin Arg Lys Leu Ser Val Ala lie Ala Phe Val Gly 

980 m »' 985 . . • 990 • 

Gly Ser Gin Val Val lie Leu Asp Glu Pro Thr Ala Gly Val Asp. 

9^5 , , . 1000 . 1005 y 

Pro Ala Ser Arg Arg Gly lie Trp Glu Leu Leu Leu Lys Tyr Arg 

1010 lbl5 • 1020 

Glu Gly Arg Thr Leu lie Leu Ser Thr His His Leu» Asp Glu Ala 

1025 103d 1035 ■ 

Glu Leu Leu Gly Asp Arg Val Ala Val Val Ala Gly Gly Arg Leu 

1040 1045 1050 

Cys Cys Cys Gly Ser Pro Leu Phe Leu Arg Arg Hi& Leu Gly Ser 

1055 1060 . 1065 

Gly Tyr Tyr Leu Thr Leu Va,l Lys Ala Arg Leu ' Pro Leu Thr Thr 

1070 1075 1080 

Asn Glu Lys Ala ,Asp Thr Asp Met Glu Gly Ser Val Asp Thr Arg 

1085 1090 1095 

Gin Glu Lys* Lys Asn Gly Ser Gin Gly Ser Arg Val Gly Thr Pro 

IlpO' ' 1105 1110 

Gin Leu Leu Ala Leu Val Gin His Trp Val Pro Gly Ala Arg Leu 
.1115 1120 1125 

Val Glu Glu Leu Pro His Glu Leu Val Leu Val Leu Pro Tyr Thr 

1130 1135 . • . 1140 

Gly Ala His Asp Gly Ser Phe Ala Thr Jjeu Phe Arg Glu L^u Asp 
. 1145 1150 1155 

Thr Arg Leu Ala Glu Leu Arg Leu Thr Gly Tyx Gly lie Ser Asp 

1160 ^ i • . . 1165 1170 

Thr Ser Leu Glu Glu He Phe Leu Lys Val Val Glu Glu 'Cys Ala 
, 1175 1180 1185 

Ala Asp Thr Asp Met Glu Asp Gly Ser Cys Gly Gin His , Leu Cys 

1190 . 1195 1200 

Thr Gly lie Ala Gly Leu Asp Val Thr Leu Arg Leu Lys Met Pro 

1205 1210 1215 

Pro Gin Glu Thr Ala Leu Glu Asn Gly Glu Pro Ala Gly Ser Ala 

1220 1225 1230 

Pro Glu Thr Asp Gin Gly Ser Gly Pro Asp Ala Val Gly Arg Val 

1235 1240 1245 * 

Gin Gly Trp Ala Leu Thr Arg Gin Gin Leu Gin Ala Leii Leu Leu 

1250 1255 1260 

Lys Arg Phe Leu Leu Ala Arg Arg Ser Arg Arg Gly Leu Phe Ala 

1265 1270 1275 

Gin lie Val Leu Pro Ala Leu Phe Val Gly Leu Ala Leu Val Phe 

1280 1285 1290 

Ser Leu lie Val Pro Pro Phe Gly His Tyr Pro Ala Leu Arg Leu 

1295 . 1300- 1305 

Ser Pro Thr Met Tyr Gly Ala Gin Val Ser Phe Phe Ser Glu Asp 

1310 1315 1320. 

Ala Pro Gly Asp Pro Gly Arg Ala Arg Leu Leu Glu Ala Leu Leu 

1325 1330 1335 

Gin Glu Ala Gly Leu Glu Glu Pro Pro Val Gin His Ser Ser His 

1340 ' 1345 1350 

Arg Phe Ser Ala Pro Glu Val Pro Ala Glu Val Ala Lys Val Leu 

1355 13 60 13 65 

Ala Ser Gly Asn Trp Thr Pro Glu Ser Pro Ser Pro Ala Cys Gin 

1370 1375 1380 

•Cys Ser Arg Pro Gly Ala Arg Arg Leu Leu Pro Asp Cys Pro Ala 

1385 1390 1395 

Ala Ala Gly Gly Pro Pro Pro Pro Gin Ala Val Thr Gly Ser Gly 

1400 1405 1410 

Glu Val Val Gin Asn Gin Thr Gly Arg Asn Leu Ser Asp Phe Leu 

1415 1420 1425 

Val Lys Thr Tyr Pro Arg Leu Val Arg Gin Gly Leu Lys Thx Lys 

1430 1435 1440 

Lys Trp Val Asn Glu Val Arg Tyr Gly Gly Phe Ser Leu Gly Gly 

1445 . 1450 1455 

Arg Asp Pro Gly Leu Pro Ser Gly Gin Glu Leu Gly Arg Ser Val 

1460 1465 1470 
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Glu Glu Levi Trp Ala Leu Leu Ser Pro Leu Pro Gly Gly Ala Leu 

1475 1480 1485 
Asp Arg Val Leu Lysi Asn 'teu^'Thr Ala Trp 'Ala His Ser Leu Asp • 

1490 1495 1500' 

Ala Gin Asp Ser Leu Lys He Trp Phe Asn Asn tys Giy Trp His i- 

, . 1505 1510 1515 

Ser Met Vail Ala Phe Val Asn Arg Ala' Ser Asn Ala lie Leu Arg 

1520 1525 • 1530 

Ala His Leu Pro Pro Gly Pro Ala Arg His Ala His Ser He Thr 

1535 1540 . 1545 

Thr Leu Asn His Pro Leu Asn Leu Thr Lys Glu Gin Leu Ser Glu 

1550 1555 • 1560 

Ala Ala Leu Met Ala Ser Ser Val Asp Val Leu Val Ser He Cys 

1565 • I . * 1570 ' . 1575 

Val Val Phe Ala Met Ser Phe Val Pro Ala Ser- Phe Thr Leu Val 

1580 1585 1590 

Leu He Glu Glu Arg Val Thr Arjg Ala Lys His Leu Gin Leu Met 

1595 ' , 1600 1605 

Gly Gly Leu Ser Pro' Thr Leu Tyr Trp Leu Gly Ash Phe Leu Trp 

1610 1615 1620 

Asp Met Cys Asn Tyr Leu Val Pro Ala Cys He Val Val Leu lie 

1625 1630 1635 

Phe Leu Ala Phe Gin Gin Arg Ala Tyr Val Ala Pro Ala Asn Leu 

1640 1645 * 1650 

Pro Ala Leu Leu Leu Leu Leu Leu Leu Tyr Gly Trp Ser He Thr 

1655 1660 1665 

Pro Leu Met Tyr Pro Ala ^ef the Phe Phe Ser Val Pro Ser Thr 

1670 1675 ♦ 1680 

Ala Tyr Val Val Leu Thr Cys He Asn Leu Phe lle Gly He Asn. 

1685 ' - 1690 . 1695 

Gly Ser Met Ala Thr Phe Val Leu Glu Leu Phe Ser Asp Gin Lys 

1700 1705 1710 

Leu Gin Glu Val Ser Arg He Leu Lys Gin Val Phe Leu He Phe 

1715 1720 1725 

Pro His Phe Cys Leu Gly Arg Gly Leu He Asp Met Val Arg Asn 

1730 . 1735 • 1740 , 

Gin Ala Met Ala Asp Ala Phe Glu Arg L'eu Gly Asp* Arg Gin Phe 

1745 1750 1755 

Gin Ser Pro Leu Arg Trp Glu Val Val Gly Lys Asn Leu Leu Ala 

1760 1765 . 1770 

Met Val He Gin Gly Pro Leu Phe Leu Leu Phe Thr Leu Leu Leu 

1775 1780 1785 

Gin His Arg Ser Gin Leu Leu Pro Gin Pr'o Arg Va3, Arg Ser Leu 

1790 1795 ■ 1800 

Pro Leu Leu Gly Glu Glu Asp Glu Asp Val Ala Arg Glu -Arg Glu 

1805 1810 1815 

Arg Val Val Gin Gly Ala Thr Gin Gly Asp Val lifeu Val Leu Arg 

1820 1825 1830 

Asn Leu Thr Lys Val Tyr Arg Gly Gin Arg Met Pro Ala Val Asp 

1835 1840 1845 

Arg Leu Cys Leu Gly He Pro Pro, Gly Glu Cys Phe Gly Leu Leu 

1850 1855 I860 

Gly Val Asn Gly Ala Gly Lys Thr Ser Thr Phe Arg Met Val Thr 

1865 1870. 1875 

Gly Asp Thr Leu Ala Ser Arg Gly Glu Ala Val Leu Ala Gly His 

1880 1885 1890 

Ser Val Ala Arg Glu Pro Ser Ala Ala His Leu Ser Met Gly Tyr 

1895 1900 1905 

cys Pro Gin Ser Asp Ala He Phe Glu Leu Leu Thr Gly Arg Glu 

. 1910 1915 1920 

His Leu Glu Leu Leu Ala Arg Leu Arg Gly val Pro Glu Ala Gin 

1925 1930 1935 

Val Ala Gin Thr Ala Gly Ser Gly Leu Ala Arg Leu Gly Leu Ser- 

1940 1945 1950 

Trp Tyr Ala Asp Arg Pro Ala Gly T2ir Tyr Ser Gly Gly Asn Lys 

1955 I960 1965 

Arg Lys Leu Ala Thr Ala Leu Ala Leu Val Gly Asp Pro Ala Val 
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1970 . 1975 1980 

Val Phe lieu Asp Glu Pro Thr Thr Gly Met Asp Pro Ser Ala Arg 

1985 1990 1995 

Arg Phe Beu Trp Asn Ser Leu I*eu Ala Val Val Arg Glu Gly Arg 

2000 • 2005 2010 

Ser Val Met • Leu Thr Ser His Ser 'Met dlu Glxi cys Glu Ala Leu: 

2015 2020 2025 

Cys Ser Arg Leu Ala lie Met Val Asn Gly Arg Phe Arg Cys- Leu 

2030 2035 . 2040 

Gly Ser Pro Gin His Leu Lys Gly Arg Phe Ala Ala Gly His Thr 

2045 2050 2055 

Leu Thr .Leu Arg Va;l Pro Ala Ala Arg Ser Gin Pro Ala Ala Ala 

2060 , ' 2065 . 2070 . 

Phe Val Ala Ala Glu Phe Pro Gly Ala Glu Leu Arg Glu Ala His - . - 

2075 * 2080 • . ' 2085 

Gly Gly Arg Leu Arg Phe Gin Leu Pro Pro Gly Gly Arg Cys Ala 

2090 2095 2100 

Leu Ala Arg Val Phe Gly G^u Leu Ala Val His Gly Ala Glu His 

210i5 2110 . . ' 2115 

Gly Val Glu Asp Phe Ser Val Ser Gin Thr Met' Leu Glu Glu Val 
• * 2120 . 2125 ■ 2130 

Phe Leu Tyr Phe Ser Lys Asp Gin Gly Lys . Asp Glu Asp Hhr Glu 

2135 . 2140 2145 

Glu Gin Lys Glu Ala Gly Val Gly Val Asp Pro Ala Pro Gly Leu 
' ' 2150 2155 2160 . 

Gin His Pro Lys Arg Val Ser iSln Phe Leu Asp Asp Pro Ser Thr 
^ 2165 2170 2175. 

Ala Glu Thr Val Leu . • • . 

2180 

<210> 28 

<211> 1737 * . 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> itiisc_f eature 

<223> Incyte ID No: 747784Scm 

<400> 28 

Met Leu Lys Arg Lys Gin Ser Ser Arg Val Glu Ala Gin Pro Val 
1 5 10 15 

Thr Asp Phe Gly Pro Asp Glu Ser Leu Ser Asp Asn Ala Asp lie 
20 25 30 

Leu Trp lie Asn Lys Pro* Trp Val His Ser Leu Leu Arg lie Cys 
35 40 45 

Ala lie lie Ser Val lie Ser Val Cys Met Asn OSlr Pro Met Thr 
50 55 60 

Phe Glu His oyr Pro Pro Leu Gin 'Tyr Val thr Phe Thr Leu Asp 
- 65 70 . 75 

Thr Leu Jieu Met Phe Leu Tyr Thr Ala Glu Met lie Ala Lys Met 
80 85 90 

His lie Arg Gly lie Val Lys Gly Asp Ser Ser Tyr Val Lys Asp ' 

95 100 105 

Arg Trp cys Val Phe Asp Gly Phe Met- Val Phe Cys Leu Trp Val 
110 115 120 

Ser Leu Val Leu Gin Val Phe Glu lie Ala Asp lie Val Asp Gin 
125. 130 135 

Met Ser Pro Trp Gly Met Leu Arg lie Pro Arg Pro Leu He Met 
140 145 150 

He Arg Ala Phe Arg He Tyr Phe Arg Phe Glu Leu Pro Arg Thr 
155 160 165 

Arg He Thr Asn He Leu Lys Arg Ser Gly Glu Gin He Trp Ser 
170 175 180 

Val Ser He Phe Leu Leu Phe Phe. Leu Leu Leu Tyr Gly He Leu 
185 190 195 

Gly Val Gin Met Phe Gly Thr Phe Thr Tyr His Cys Val Val Asn 
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Glu 
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Phe 


Lys 




Mot- 
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Leu Gly 


Tyr 


Ser 


Gly Phe 










^ ou 










±nr 


va± 


Tyr 


^ JLU. 


Axa 


Ala 


Ser 










0*7 c: 
z / 3 










lyr 


Arg 


TV 1 -» 

Axa 


T1 0 
J. j.e 


Asp 


Ser 


Phe 










0 Q n 


y 






lyr 




IXe 


Thr 


Leu 


xxe 


Phe 


Phe 










^ UD 












He 


Ala 


va,X 


xxe 


He 


Glu 


















m n 


Pino 


Gin 


Gin 


Mot" 


xrp 


Gly Ser 










•J *3 C 










rm- 

Tnir 


Thr 


Gin 


1VAa4> 


Jrne . 


His. 


.Glu 


















Xi6U 




Ala 


Val 


Asp 


vax 


Ash 


Lys 










job 








Cys 


ijeu 


Oln 


Lys 


Met 


i>3ec 


Arg 


Ser 










0 0 r\ 








Zj'GU 


Seir 


Met 


Val 


Thr 


vax 


Asp 


Val 
















Tyr 


Lys 


Gly Glu 


Asn 


Phe 


Arg 


Arg 










4XU 










V7IU 


Val 


Ala 


Pile 


Thr 


Val 


Leu 










425 








uys 


xxe 


Trp 


Cys 


Leu 


Gly 


Phe 


Thr 










44 U 








nlS 


Lys 


Phe 


Glu 


Leu 


Lieu 


Leu 


Val 










455 








Tyr 


Pro 


Asp Leu 


Tyr 


His 


Ser 


Gin 










470 








Arg 


Vai 


Val 


Arg 


Leu 


lie 


Lys 


He 










485 








vai 


Tyr 


Lys 


He 


Pne 


Gly 


Pro 


Gly 










500 








vax 


Fxie 


Thr 


Ala 


Ser . 


Leu 


Leu 


He 










etc 
515 












Phe 


Cys 


pne 


vax 


Glu 


Glu 










O J U 








Pro 


Arg 


Ala 


Phe 


Met 


Ser. 


Met 


Phe 










545 








.Trp 


vax 


Asp 


Val 


Meu 


Asp 


Gin 


Thr 










c ^ n 
50 U 








Trp 


iiXa 


Pro 


Val 


vax 


Axa 


He 


Tyr 
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AT =» 


Tiir 


Leu 


lie 


Leu 


Leu 


Ser, 


Leu 


















aSH 


• 


Glu 


Leu 


Asp 


VaXU 


Asp 


Leu 










DUD 








Ser 


Glu 


Ala 


Asn 


/ixa 


Asp 


Thr 






















lie 


Phe 




Lys 


Phe 


Pro 










DO 5 






lie 


Ser 


Lys 


Leu 


Pro 


Ser 


Asp 


Phe 










650 








Ser 


Phe 


Met 


Lys 


Gin 


Phe 


lie 


Asp 










665 






Cys 


Leu 


Leu Arg 


Ser 


Leu 


Pro 


Thr 










680 








His 


Ser 


Lys 


Arg 


Ser 


Ala 


He 


Glu 



695 













0 1 A 

210 


Trp 


Asn 


Ser 


Leu 


Ala Xle 


Pro 


■ 


0 


■ 1 




■ ■ 


225 


Glu 


Glu 


Gly 


Tyr 


.Gin Cys 


Pro 






■ 1 






^ A n 

240 


v^XU 


Asp. 


Leu 


Gly 


• 

Leu Ser 


Arg 




Z D u 








0 c c 
Z55 


Asn 


v^XU 


xxe 


Gly 


Thr Ser 


He 




Z DD 








id / 0 


Gin 


(jXU 


Gxy 


Tarp 


Val Phe 


Leu 


• 


0 Q rt 








285 


Pro 


Arg 


Trp 


Arg 


Ser Tyr 


Pne 


• 


0 Q C 

z y 5 






3 00 . 


Leu. 


TV 

Axa 


Trp 


Leu 


- 

Val Lys 


Asn 










oXb 


Thr 


•pne 


AX a 


Glu 


.He Alrg 


Val 










330 


Arg 


Ser 


Ser» 


Tnr 


Thr ser 


. Thir 




340 








345 


Asp 


Ala 


Ala 


Gly 


Gly Trp 


Gin 




*5 C C 

3 55 






- 


360 


Pro 


Gin 


Gly 


Arg 


Ala Pro 


Ala 




"2 T A 
3/0 








375 


Ser 


Val 


Phe 


HXS 


Met Phe 


He 




•5 0 c 
385 








390 


lie 


Val 


Ala 


Ala- 


Ser -Asn 


Tyr 




Ann 

400 ; 






• 


405 


Glii 


Tyr 


Asp 


Glu 


Phe Ty^r 


Leu 




415 








420 


Phe 


Asp 


Leu 


Glu 


Ala Leu 


Leu 




43 0 








435 


Gly 


Tyr 


He 


Ser 


Ser Ser 


Leu 




445 








450 


He 


Gly 


Thr 


Thr 


Leu His 


Val 




460 








465 


Phe 


Thr 


Tyr 


Phe 


Gin Val 


Leu 




475 








480 


Ser 


Pro 


Ala 


Leu 


Glu Asp 


Phe 




49 0 






495 


Lys 


Lys 


Levi 


Gly 


Ser Leu 


Val 




505 








510 


Val 


Met 


Ser 


Ala 


He Ser 


Leu 




520 








525 


Leu 


Asp 


Arg 


Phe 


Thr Thr 


Phe 




535 








540 


Gin 


He. 


Leu 


Thr 


Gin Glu 


Gly 




C r— 0 

550 








555 


Leu 


Asn 


TV T 

Ala 


Val 


Gly His 


Met 




5o5 






fx 

57 0 


Pne 


xle 


Leu 


Tyr 


His Leu 


Phe 




DO U 








c 0 c 

585 


Pne 


VaX 


Ala 


Val 


He Leu 


Asp, 




c 0 c 
5^5 








600 


uys 


uys 


Leu 


Lys 


Gin Leu 


Lys 




C\ A 
0X0 






615 


Lys. 


Glu 


Lys 


Leu 


Pro Leu 


Arg 












d3 0 


Asn 


Arg 


Pro 


Gin 


Met Val 


Lys 












C A C 

645 


Thr 


vax 


Pro 


Lys 


IXe Arg 


Glu 




DOD 






CCA 


Arg 


Gxn 


Gxn 


Gxn 


Asp Thr 


Cys 




670 








675 


Thr 


Ser 


Ser 


Ser 


Ser Cys 


Asp 




685 








690 


Asp 


Asn 


Lys 


Tyr 


He Asp 


Gin 




700 








705 
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Lys Leu Ar^ Lys Ser Val Phe Ser He Arg Ala Arg Asn Leu .Leu 
710 . 715 720 

Glu Lys Giu Thr Alai ♦Val "rrhr' Lys He ibeu 'i^rg Ala Cys Thr Arg • 

725 730 735. 

Gin Arg Met Leu iSer Gly Sfr Phe Glu Gly Gin Pro Ala Lys Glu 
740 745 756 

Arg Ser IlQ Leu Ser Val Gin His His'lle Arg Gin Glu Arg Arg 
755 760 - . 765 

Ser Leu Arg His Gly Ser Asn Ser Gin Arg He Ser Arg Gly Lys • 

770 . . 775 780 

Ser Leu Glu Oi^hr Leu Thr Gin Asp His Cys Asn Thr Val- He Tyr 

785 : . 790 t 

Arg Asn Ala Gin Arg Glu Val Ser Glu He Lys Met lie Gin Glu 
800 , • 805 ' .810 

Lys Lys Glu Leu Ala Glu Met Leu Gin Gly Lys Cys Lys Lys Glu 
. 815 . 820 825 

. Leu Arg Glu Ser His Pro Tyr Phe Asp Lys Pro Leu Phe Jle Val 
830' ^ 835 jB40 

Gly Arg Glu His A;r^ Phe Arg Asn Phe Cys Arg Val Val Val Arg 
845 850 855 

Ala Arg Bhe Asn Ala Ser Lys Thr Asp Pro val Thr Gly Ala Val 
860 ' 865 870 

Lys Asn Thr Lys Tyr His Leu Leu Tyr Asp Xieu Leu Gly Leu Val 
875 .880 I 885 

Thr. Tyr Leu Asp Trp Val Met He He Val Thr Ser Asp Ser Cys 
.890 895 900 

He Ser Met Met Phe Glu Set 'Pro Phe Arg Arg Val Met His Ala 
9*05 910 » 915 

Pro Thr Leu G In He Ala Glu Tyr Val Phe Val He Phe Met Ser 
920' 925 . 930 

He Glu Leu Asn Leu Lys He Met Ala Asp Gly Leu Phe Phe Thr 
935 940 945 

Pro Thr Ala Val He Arg Asp Phe Gly Gly Val Met Asp He Phe 
950 955 960 

He Tyr Leu Val Ser Leu He Phe Lfeu Cys Trp Met Pro Gin Asn 
965 . 970 , 975 

Val Pro Ala Glu Ser Gly Ala Gin Leu Leu Met Val Leu Arg Cys ' 

980 985 990 

Leu Arg Pro Leu Arg He Phe Lys Leu Val Pro Gin Met Arg Lys 
.995 1000 1005 

Val Val Arg Glu Leu Phe Ser Gly Phe Lys Glu He Phe Leu Val 
1010 1015 1020 

Ser He Leu Leu Leu Thr Leu Met . Leu Val Ph6 Ala Ser Phe Gly 
1025 1030 ' . 1035 

Val Gin Leu Phe Ala Gly Lys Leu Ala Lys Cys Asn Asp Pro Asn 
1040 1045 . ' 1050 

He He Arg Arg Glu Asp Cys Asn Gly He Phe Altg He Asn Val 
1055 1060 1065 

Ser Val Ser Lys Asn Leu Asn Leu Lys Leu Arg Pro Gly Glu Lys 
1070 1075 1080 

Lys Pro Gly Phe Trp Val Pro Arg Val Trp Ala Asn Pro Arg Asn 
1085 1090 1095 

Phe Asn Phe Asp Asn Val Gly Asn Ala Met Leu Ala Leu Phe Glu 
1100 IIOS 1110 

Val Leu Ser Leu Lys Gly Trp Val Glu Val Arg Asp Val He He 
1115 1120 1125 

His Arg Val Gly Pro lie His Gly He Tyr lie His Val Phe Val 
1130 1135 1140 

Phe Leu Gly Cys Met He Gly Leu Thr Leu Phe Val Gly Val Val 
1145 1150 .1155 

He Ala Asn Phe Asn Glu Asn Lys Gly Thr Ala Leu Leu Thr Val 
1160 1165 1170 

Asp Gin Arg Arg Trp Glu Asp Leu Lys Ser Arg Leu Lys He Ala 
1175 1180 1185 

Gin Pro Leu His Leu Pro Pro Arg Pro Asp Asn Asp Gly Phe Arg 
1190 1195 1200 

Ala Lys Met Tyr Asp He Thr Gin His Pro Phe Phe Lys Arg Thr 
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1205 






XzXU 


1215 


j.xe 




Leu 


Leu Val 


Leu 


Axa 


vsxn ser vax Leu Leu 


Ser Val Lys 








1220 








■1 0*^ A 

X.^JU 


Trp 


Asp 


Val Glu Asp 


pro 


vax 


xnr vax pro Deu AXa 


•Tnr Met Ser 














1 0 ^ n 


, ' 1245 


V CLJL 


Val 
1 




XriX r'xlc 


X X fc; 


irXlti 


1 T.on OTii '\7ial TtW-v 

vdx ij«3u. ijxu vax xnr 


Met Liys xxe 






X/J 3 \J 








1 0 A 


JL 


AT ^ 




oex irxO 


aXCI 


v»Xy 


irne xrp v^xn ber Arg 


Arg Asn Arg 








' xz o«<> 






XZ / U 


Xz / b 






Lieu 


jjeix vax 


XllX 


oex 


jjeu. Kaxy vax vax xrp 


vax vax Xjeu 








1280 






X^iOD 








Ala 


Leu Leu 


Asn 


Ala 


xyr xnGT xyr Mec Me.t 


Gly Ala Cys 








1295 


V 




X J u u 


1 c 
X JUb 


VdJL 


X X 


Val 


Phe Arg 




irlie 


ber xxe . cys . Gxy Lys 


Hxs Vax Tnr 








1310 






1315 


1 Ton 






Met 


Leu Leu 


_ 

Leu 


Thr 


Val Val Val Ser Met 


!iyr Lys Ser 








1325 




M^t 


1330 








lie 


lie Val 


iyj.y 


Phe Leu Leu Leu- Leu 


Cys Tyr .Ala 








1340 






1345 








Gly Val Val 


Leu 


Phe.. Gly Thr Val Lys Tyr 


Gly Glu Asn 








.1355 






1360 


1365 




ASH 


Arg His Ala 


Asn 


Phe 


Ser Ser Ala Gly Lys 


Ala lie Thr 








1370 






1375 


1380 


1 Vial 


_ 


Phe Arg lie 


vax 


Thr 


Gly Giu Asp Trp Asn 


Lys lie Met 








1385 






1390 


1395 


IliS. 


Asp 


Cys 


Met Val 


Cjixn 


Pro 


Pro Phe Cys Thr Pro 


Asp Glu Phe 








1400 






1405 . 


1410 


XXIJl 


Tyr 


Trp Ala Thr 


Asp 


Cys 


Gly Asn Tyr Ala Gly 


Ala Leu Met 








1415 






. 1420 


1425 


Tyr 


JrXie 


Cys 


Ser Phe 




Val 


lie lie Ala Tyr lie 


Met Leu Asn 








1430 






1435 


• 1440 






Val 


Ala lie 


xxe 


Val 


Glu Asn Phe Ser Leu 


lie Tyr Ser 








1445 






1450 


1455 




V7J.U. 


Glu Asp Gin 


xjeu 


Leu 


Ser Tyr Asn Asp Leu 


Arg. His Phe 








1460 






1465 


1470 


V9J.X1 


TT i=k 


lie 


Trp Asn 


Met 


Val 


Asp Asp Lys Arg Glu 


Val Phe Pro 








1475 






1480 


1485 




irile 


Arg Val Lys 


Fne 


Leu 


Leu Arg Leu Leu Arg 


Gly Arg Leu 








1490 






1495 


1500 


\jX U. 


va.X 


Asp Leu Asp 


Lys 


Asp 


Lys Leu Leu Phe Lys 


His Met Cys 








1505 






1510 


1515 


lyr 


vsXu. 


Met 


Glu Arg 


jjeu 


His 


Asn Gly Gly Asp Val 


Thr Phe His 








1520 






1525 


1530 




Vclx 


Leu 


Ser Met 


Leu 


Ser 


Tyr Arg Ser Val Asp 


lie Arg Lys 








1535 






1540 


1545 






Gin 


Leu Glu 


<jXU- 


Leu 


Leu Ala Arg Glu Qln 


Leu Glu Tyr 








1550 






1555 


1560 


'xXix 


X Xc= 


Glu 


Glu Glu 


- vax 


Ala 


Lys Gin Thr lie Arg 


Met Trp Leu 








.1565 






1570 


1575 




Liys 


Cys 


Leu Lys 


Arg 


lie 


Arg Ala Lys Gin Gin 


Gin Ser Cys 








1580 






' 1585 


1590 


oc=x 


X X6 


lie 


His Ser 


Leu 


Arg. 


Glu Ser Gin Gin Gin 


Glu Leu Ser 








1595 






1600 


X DUD 


21 T"rT 
. AX ^ 




Leu 


Asn Pro 


irx 0 


Ser 


lie Glu Thr Thr Gin 


Pro Ser Glu 








1610 






1615 


X U 




xxix 


Asn 


Ala Asn 


Ser 


Gin 


Asp Asn. Ser Met Gin 


Pro Glu Thr 








1625 






1630 


1635 


«9 vsx 


oc?x 


Gin 


Gin Glri 


x^eu 


Leu 


Ser Pro Thr Leu Ser 


Asp Arg Gly 








1640 






1645 


I /r C A 


Gly 


•9 CSX 


Arg Gin Asp 


Axa 


Ala 


Asp Ala Gly Lys Pro 


Gin Arg Lys 








1655 






.1660 


■1 £r c 

1665 


Phe 


GlV 


Gin Trp Arg 


Leu 


Pro 


ser Ala Pro Lys Pro 


xxe oer hxs 








1670 






1675 


1680 


Ser 


Val 


Ser 


Ser Val 


Asn 


Leu 


Arg Phe Gly Gly Arg 


Thr Thr Met 








isas 






1690 


1695 


Lys 


Ser 


Val 


Val Cys 


Lys 


Met 


Asn Pro Met Thr Asp 


Ala Ala Ser 








1700 






1705 


1710 
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Cys Gly ser Glu Val Lys Lys Trp Trp Thr Arg Gin Leu Thr Val 
1715 1720 1725 

Glu Ser Asp Glu Ser Gly J^sp. Asp Leu Leu Asp He 
1730^' 17*35 • 

<210> 29 ... . * • I . , 

<211> 547 ... ' 

<212> PRT . ' 

<213> Homo sapiens. ! * 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 168,827001 • * 



<400> 29 



Met 


Ala 


Phe 


Gin 


Asp 


Leu 


Leu 


Asp 


Gin Val 


Gly 


Gly Leu Gly 


Arg 


1 








• 5 








10 




15 


Phe 


Gin 


Hei Leu 


Gin ^et 


Val 


Phe 


Leu .He. 


Met 


Phe Asn Val 


He 






His 




20, 








25 








Val 


Tyr 


Gin 


Thr 


Gin 


Leu 


Glu 


Asn Phe 


Ala 


Ala Phe Tie* 


XJCSU- 










35 








40 






Asp 


His Arg Cys 


Trp 


Val 


His 


He 


Leu Asp Asn 


Asp Thr He 


Pro 










50 








55 




OU 


Asp 


Asn Asp 


Pro 


Gly Thr 
65 


Leu 


Ser Gin Asp 
70 


Ala 


Leu Ljsu Aifg 


T -l A 

xj.e 

/ D 


Ser 


lie 


Pro 


Phe 


Asp 


Ser 


Asn 


Leu Arg Pro 


Glu 


Lys Cys Arg 


Arg 






His 




■ 80 








85 




Q n 
y u 


Phe 


Val 


Pro 


Gin 


Trp 


Lys* 


Leu 


He His 


Leu 


Asn Gly,a?ir 


Phe 










95 








100 






Pro 


Asn 


Thr 


s^r 


Glu .Pro 


Asp 


Thr- 


Glu Pro 


Cys 


Val Asp Gly 


Trp 










110 








, 115 




Ton 


Val 


Tyr Asp Gin 


Ser 


Ser 


Phe 


Pro 


Ser Thr 


He 


Val Thr Lys 


Trp 










125 








130 






Aspi 


Leu Val 


Cys 


Glu 


Ser 


Gin 


Pro 


Leu Asn 


Ser 


Val Ala Lys 












140 








; 145 






Leu 


Phe 


Met 


Ala 


Gly Met 


Met 


Val 


Gly Gly 


Asn 


Leu Tyr Gly 


His 










155 








1,60 




X 03 


Leu 


Ser Asp Arg 


Phe Gly 


Arg 


Lys 


Phe Val 


Leu 


Arcf Ti^D Set* 


j.yx 










170 








175 




180 


Leu 


Gin 


Leu 


Ala 


He 


Val 


Gly Thr 


Cys Ala 


Ala 


Phe Ala Pro 


Thr 


He 








185 








190 






195 


Leu 


Val 


Tyr 


Cys 


Ser 


Leu 


Arg 


Phe Leu 


Ala 


Gly Ala Ala 


Thr 


Phe 








200 








.205 




210 


Ser 


He 


He 


Val 


Asn 


Thr 


Val 


Leu Leu 


He 


Val Glu Tirp 


He 




His 






215 








220 




225 


Thr 


Gin 


Phe 


Cys 


Ala 


Met 


Ala' Leu Thr 


Leu 


Thr Leu Cys 


Ala 


Ala 








230 








235 




240 


Ser 


He 


Gly 


His 
245 


He 


Thr: 


Leu 


Gly Ser 
250 


Leu 


Ala Phe Val 


He 
255 


Arg 


Asp 


Gin 


Cys 


He 


Leu 


Gin 


Leu 


Val Met 


Ser 


Ala Pro Cys 


Phe 










260 








, 265 




270 


Val 


Phe 


Phe 


Leu 


Phe 


Ser 


Arg Trp 


Leu Ala 


Glu 


Ser Aia Arg 


Trp 










275 








280 




285 


Leu 


He 


He 


Asn 


Asn 


Lys 


Pro Glu Glu Gly 


Leu 


Lys Glu Leu 


Thr 








His 


290 








295 




300 


Lys 


Ala 


Ala 


Arg Asn 


Gly Met 


Lys Asn 


Ala 


Glu Asp He 


Leu 


Thr 








305 








310 




315 


Met 


Glu Val 


Leu 


Lys 


Ser 


Thr 


Met Lys 


Gin 


Glu Leu Glu 


Ala 


Ala 








320 








325 






330 


Gin 


Lys 


Lys 


His 


Ser 


Leu 


Cys 


Glu Leu 


Leu 


Arg lie Pro 


Asn 


He 








335 








340 




345 


Cys 


Lys 


Arg 


He 


Cys 


Phe 


Leu 


Ser Phe 


Val 


Arg Phe Ala 


Ser 










350 








355 




360 


Thr 


He 


Pro 


Phe 


Trp Gly 


Leu 


Thr 


Leu His 


Leu 


Gin His Le;u 


Gly 










365 








370 






375 


Asn 


Asn 


Val 


Phe 


Leu 


Leu 


Gin 


Thr 


Leu Phe 


Gly 


Ala Val Thr 


Leu 










380 








385 




390 
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Leu 


AjLa. 


Asn 


Cys 


Val Ala 


Pro Trp 


• 








395 . 


Arg 


Leu 


oer 


c»xn 


Met, Leu 'Leui«Met 










410 




Leu 




X le 


xxe 


Pile Val 


Pijo Gin 










425 




. 

Leu 




Thr 


Leu Gly 


Val Gly 










440 




Cys 


Ser 


Thr 


Ala 


Gin Glu 


Asn Glu 










455 






Ai y 


Ala 


Thr Gly lie 


Thr Gly 










470 


A±ci 


Leu 


Ala 


Ser 


Leu I4e1^ 
485 


Met lie 


Leu 


Pro 


. Trp 


lie 


lie Tyr 


Gly Val 










.500 




veil 


vajL 


Leu 


Leu 


Leu Pro 


Glu Thr 






.1 




515 ' 




Ser 


lie 


Gin 


Asp V^l» Glu 


Ash 'gIu 










53 0 




Pro 


Gin 


Aorg 




Ser Val 


Leu 










545 





<210> 30 ' 
<211> 547 
<212> PRT 

<2I3> Homo sapiens ^ » * . 

<220>- 

<221> mis cofeature ' 

<223> Incyte ID No: 7472734CD1 

<400> 30 



Met 


Gly Phe Asp 


Val 


Leu 


Leu Asp 


1 








5 




Phe 


Gin 


He 


Cys 


Leu 
20 


He 


Ala Phe 


Leu 


Phe 


Pro 


Asn 


He 
35 


Val 


Leu iSlu 


Ser 


His 


Arg 


Cys 


Trp 
50 


Val 


Pro Leu 


Asp> 


Asn Asp 


Thr 


Gly Thr Leu Ser 








65 






Ser 


He 


Pro 


Leu 


Asp 


Ser Asn Leu 










80 






Phe 


He 


His 


Pro 


Gin 


Trp 


Gin Leu 










95 




Pro 


Asn 


Thr 


Asn 


Glu 


Pro 


Asp Thr 










110 




Val 


Tyr 


Asp Arg 


Ser 


Ser 


Phe Leu 










125 






Asp 


Leu 


Val 


Cys 


Glu 
140 


Ser 


Gin Ser 


Leu 


Phe 


Met 


Ala 


Gly 
155 


Ser 


Leu Leu 


Leu 


Ser Asp Arg 


Phe Gly Arg Lys 










170 






Leu 


Gin 


Leu 


Ala 


He 


Val 


Gly Thr 










185 




He 


Leu 


Val 


Tyr 


Cys 


Ser 


Leu Arg 










200 




Phe 


Ser 


He 


He- 


Val 
215 


Asn 


Thr Val 


Thr 


His 


Gin 


Phe 


Cys 
230 


Ala 


Met Ala 


Ala 


Ser 


He Gly 


His 


He 


Thr Leu 










245 







Ala Leu Asn 


His 


Met Ser 


Arg 


400 






405 


Phe Leu 'Ijeu 


Ala 


Thr C^s 


Leu 


415 






420. 


Glu Met Gin 


Thr 


Leu Arg 


Val 


430 




435 


Ala' Ala Ser 


Leu Gly He 


Thr 


445 






450 


Leu He pro 


Ser 


lie He 


Arg 


.460 






465 


Asn Phe Ala Asn He Gly 


Gly 


475 






480 


Leu Ser He 


Tyr 


Ser Arg 


Pro 


• 490 






•495 


Phe Ala He 


Leu 


Ser Gly 


Leu 


505 




510 


Arg Asn Gin 


Pro 


Leu Leu 


Asp 


520 






525 


Gly Val Asn 


Ser 


Leu Ala 


Ala 


535 






540 



I 



Gin 


Val 


Gly Gly Met Gly 


Arg 




10 








15 


Phe 


Cys 


He Thr 


Asn 


He 


Leu 




'25 








30 


Asn 


Phe 


Thr Ala 


Phe 


Thr 


Pro 




40 








45 


Leu Asp Asn Asp Thr Val 


Ser 




55 








60 


Lys 


Asp 


Asp Leu 


Leu 


Arg 


He 




70 








75 


Arg Pro Gin Lys 


cys 


Gin 


Arg 




85 








90 


Leu 


His 


Leu Asn 


Gly 


Thr 


Phe 




100 








105 


Glu 


Pro 


Cys Val 


Asp 


Gly 


Trp 




115 








120 


Ser 


Thr 


He Val 


Thr 


Glu 


Trp 




130 








135 


Leu 


Lys 


Ser Met 


Val 


iGln 


Ser 




145 








150 


Gly Gly. Leu He 


Tyr Gly 


His 




160 








165 


Phe 


Vai 


Leu Arg Trp 


ser 


Tyr 




175 








180 


Cys 


Ala 


Ala Phe 


Ala 


Pro 


Thr 




190 








195 


Phe 


Leu Ala Gly Ala Ala 


Thr 




205 








210 


Leu 


Leu 


lie Val 


Glu 


Trp 


He 




220 






225 


Leu 


Thr 


Leu Thr 


Leu 


Cys 


Ala 




235 






240 


Gly 


Ser 


Leu Ala 


Phe 


Val 


He 




250 








255 
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Asp Gin Cys 


X xe 


Leu 


XXI 


T .All 


v«x- 


Met Ser Ala Pro Cys 


Phe 






o c n 










265 


270 


Val 


Phe Phe 


Leu 


Phe 
275 


Ser 


Arg 


Trp 


T .oil 


Ala Glti Ser Ala Arg 
280 


Trp 
285 


Leu 


He He. 


Asn 


Asn 


Lys 


Pro 


Glu, 


l^XU 


Gly Le\i Lys Glu Leu Arg 








290 










295 


3 00 


Lys 


AlA Ala 


His 


Arg Asn 


Gly Met 


Xijr o 


Asn Ala Glu Asp He 


Leu 






305 










310 ' ; 


315 




Met Glu 


Val 


Leu 


Lys 


Ser 


Thr 




Lys Gin Glu Leu Glu 


Ala 








320 








325 


330 


Ax a 


Gin Lys 


Lys 


His 


Ser 


Leu 


cys 


V3XU. 


Leu Leu Arg He Pro 


Asn 




335 








1 


340 . . ; 


345 


lie 


Cys Lys 


Arg 


He 


Cy^ 


Phe 


Leu 


Ser 


Phe Val Arg Phe Ala 


Ser 






350- 










355 , t 


360 


Thr 


He Pro 


Phe 


Trp 


Gly Leu 


Thr 


Leu 


His Leu Gin His Leu 


Gly 








365 










370 


375 


Asn 


Asn Val 


Phe 


Leu 
380 


Leu 


Gin 


Thr 


Leu 


Phe Gly Ala Val Thr 

385 ■ 


Leu 
390 


Leu 


Ala Asn 


cys 


Val 


Ala 


Pro 


Trp Ala 


Leu Asn His Met Ser 


Arg 




395 










400 


405 




Leu Ser 


Gin 


Met 


Leu 


Leu 


Met 


Phe 


Leu Leu Ala Thr Cys 


Leu 






410 










415 


420 


Leu 


Ala He 


He 


Phe 


Val 


Pro 


Gin 


Glu 


Met Gin Thr Leu Arg Val 








425 










430 


435 


Val 


Leu Ala 


Thr 


Leu Gly Val Gly Ala Ala Ser Leu Gly He 

440 445 


Thr 








450 


cys 


Ser Thr 


Ala 


Gin 


Glu 


Asn 


Glu 


Leu 


He Pro Ser Hq lie 


Arg 






455 










460' 


465 


Gly 


Arg Ala 


Thr 


Gly 


He 


Thr 


Gly Ash 


Phe Ala Asn He Gly 


Gly 




470 










475 


480 


Ala 


Leu Ala 


Ser 


Leu 
485 


Met 


Met 


He 


Leu 


Ser lie Tyr Ger Arg 
490 


Pro 
495 


Leu 


Pro Trp 


He 


He 


Tyr 


Gly Val 


Phe 


Ala He Leu Ser Gly 


Leu 






500 










505 


510 


Val 


Val Leu 


Leu 


Leu 
515 


Pro 


Glu 


Thr 


Arg 


Asn Gin Pro Leu Leu 
520 


Asp 
525 


Ser 


He Gin 


Asp 


Val 


Glu 


Asn 


Glu 


Gly Val Asn Ser Leu Ala 


Ala 






530 










535 


540 


Pro 


Gin Arg 


Ser 


Ser 


Val 


Leu 














545 















<210> 31 
<211> 988 
<212> PRT 

<213> Homo sapiens 
<220> 

.<221> misc_f eature 

.<223> incyte ID No: 7473473CD1 

<400> 31 

Met Pro Gly Gly Lys Arg Gly Leu Val Ala Pro Gin Asn Thr Phe , 
1.5 10 15 

Leu Glu Asn He Val Arg Arg Ser Ser Glu Ser Ser Phe Leu Leu 

20 25 30 

Qly Asn Ala Glii He Val Asp Trp Pro. Val Val Tyr Ser Asn Asp 
.35 40 45 

Gly Phe cys Lys Leu Ser Gly Tyr His Arg Ala Asp Val Met Gin 

50 55 60 

Lys Ser Ser Thr Cys Ser Phe Met Tyr Gly Glu Leu Thr Asp Lys 

65 70 75 

,Lys Thr He Glu Lys Val Arg Gin Thr Phe Asp Asn Tyr Glu Ser 

80 85 90 

Asn Cys Phe Glu Val Leu Leu Tyr Lys Lys Asn Arg Thr Pro Val 

95 100 105 

Trp Phe Tyr Met Gin He Ala Pro He Arg Asn Glu His Glu Lys 
110 115 120 
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Val 


Val Leu 


Phe 


Leu 


Cys 


Thr 


Phe 


Lys 


Asp 


lie 


Thr 


Leu 


Phe 


Lys 








125 










130 










135 


Gin 


Pro lie 


Glu 


Asp 


Asp 


Ser 


Thr 


Lys 


Gly Trp 


Thr 


Lys 


Phe 


Ala 








140 










145 










150 


Arg 


Leu Thr. Arg 


Ala 


Leu 


Thr 


Asni Ser 


Arg 


Ser 


Val 


Li^u 


Gin 


Gin 








155 










160 








165 


Leu 


Thr Pro 


Met 


Asn 


Lys 


Thr 


Glu 


Val 


Val 


His 


Lys 


His 


Ser. 


Arg 








170 










175 








180 


Leu 


Ala Glu 


Val 


Leu 


Gin 


Leu Gly Ser 


Asp 


He 


£jeu 


Pro 


Gin 


Tyr 








185 










190 










195 


Lys 


Gin Glu 


Ala 


Pro 


Lys 


Thr 


Pro 


Pro 


His 


lie 


He 


Leu 


His 


Tyr 








200 










205 










210 


Cys 


Ala Phe 


Lys 


Thr 


Thr 


Trp Asp 


Trp 


Val 


He 


Leu 


He 


Leu 


Thr 








215 










220 










225 


Phe 


Tyr Thr 


Ala 


He 


Met 


Val 


Pro 


Tyr 


Asn 


Val 


Ser 


Phe 


Lys 


Thr 








230 










235 








240 


Lys 


Gin Asn 


Asn 


He 


Ala 


Trp 


Leu 


Val 


Leu 


Asp 


Ser 


Val 


Val 


Asp 








245 








250 




• 




255 


,Val 


He Phe 


Leu 


Val 


Asp 


He 


Val 


Leu 


Asn 


Phe 


His 


Thr 


Thr 


Phe 


t 






260 










265 










270 


Val 


Gly Pro 


Gly 


Gly 


Glu 


Val 


Ile 


Ser 


Asp 


Pro 


Lys. 


Leu 


lie 


Arg 








.275 










280 










285 


Met 


Asn Tyr 


Leu 


Lys 


Thr 






Val 


lie 


Asp 


Leu 


Leu 


Ser 


Cys 


• 






290 










295 








300 


Leu 


Pro Tyr Asp 


lie 


lie 


Asn 


Ala 


Phe 


Glu 


Asn 


Val 


Asp 


Glu 


Gly 








3 05 










310 










315 


lie 


Ser Ser 


Leu 


Phe 


Ser 


Ser 


Leu 


Lys 


Val . Val 


Arg 


Leu 


Leu 


Arg 








320 










"^ 9 










330 


Leu 


Gly Arg Val 


Ala 


Arg 


Lys- 


Leu 


Asp 


n J. o 


dr/j. 


Leu 


Glu 


Tyr 


Gly 








335 










^ f* u 










345 


Ala 


Ala Val 


Leu 


Val 


Leu 


Leu 


Val 


Cys 


Val 




Gly 


Leu 


Val 


Ala 








350 










J .-J 








360 


His 


Trp Leu 


Ala 


Cys 


lie 


J. 


Tyr 


Ser 


lie 


Glv 


Asp 


Tyr 


Glu 


Val 








3 65 










37 0 










375 


He 


Asp Glu 


Val 


Thr 


Asn 


Thr 


He 


Gin 


lie 


Asp 


Ser 


Trp 


Leu 


Tyr 








380 










385 










390 


Gin 


Leu Ala 


Leu 


Ser 


Ile 


Glv 


Thr 


Pro 


Tyr 


Arg 


Tyr 


Asn 


Thr 


Ser 








395 










400 








405 


Ala 


Gly He 


Trp 


Glu 


Gly 


Glv 


Pro 


Ser 


Lys 


Asp 


Ser 


Leu 


Tyr 


Val 








410 










415 










420 


Ser 


Ser Leu 


Tyr 


Phe 


Thr 


Met 


Thr 


Ser 


Leu 


Thr 


Thr 


Ile 


Gly 


Phe 








425 










430 








435 


Gly 


Asn He 


Ala 


Pro 


Thr 


Thr 


Asp Val 


Glu 


Lys 


Met 


Phe 


Ser 


Val 








440 










445 








450 


Ala 


Met Met 


Met 


Val 


Gly 


Ala 


Leu 


Leu 


Tyr 


Ala 


Thr 


lie 


■Phe 


Gly 








455 










460 










465 


Asn 


Val Thr 


Thr 


lie 


Phe 


Gin 


Gin 


Met 


Tyr 


Ala 


Aiisn 


Thr 


Asn 


Arg 








470 










475 








480 


Tyr 


His Glu Met. 


Leu 


Asn 


Asn 


Val 


Arg 


Asp 


Phe 


Leu 


Lys 


Leu 


Tyr 








485 










490 










495 


Gin 


Val "Pro 


Lys 


Gly 


Leu 


Ser 


Glu Arg 


Val 


Met 


Asp 


Tyr 


lie 


Val 








500 










505 










510 


ser 


Thr Trp 


Ser 


Met 


Ser 


Lys 


Gly 


Ile 


Asp 


Thr 


Glu 


Lys 


Val 


Leu 








515 










520 








525 


Ser 


ile Cys 


Pro 


Lys 


Asp 


Met 


Arg Ala 


Asp 


Ile 


Cys 


Val 


His 


Leu 








530 










535 










540 


Asn 


Arg Lys 


Val 


Phe 


Asn 


Glu 


His 


Pro 


Ala 


Phe 


Arg 


Leu 


Ala 


Ser 








545 










550 








555 


Asp 


Gly Cys 


Leu 


Arg 


Ala 


Leu 


Ala 


Val 


Glu 


Phe 


Gin 


Thr 


Ile 


His 








560 










565 










57 0 


Cys 


Ala Pro 


Gly 


Asp 


Leu 


lie 


Tyr 


His 


Ala 


Gly 


Glu 


Ser 


Val 


Asp 








575 










580 










585 


Ala 


Leu Cys 


Phe 


Val 


Val 


Ser 


Gly 


Ser 


Leu 


Glu 


Val 


He 


Gin 


Asp 








590 










595 










600 


Asp 


Glu Val 


Val 


Ala 


lie 


Leu 


Gly 


Lys 


Gly 


Asp 


Val 


Phe 


Gly 


Asp 








605 










610 










615 


He 


Phe Trp 


Lys 


Glu 


Thr 


Thr 


Leu 


Ala 


His 


Ala 


Cys 


Ala 


Asn 


Val 
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, 620 625 630 

Arg Ala Leu TOir oyr Cys Asp Leu His lie He Lys Arg Glu Ala 

635 I » " 640 ' . • 645 ♦ 

Leu Leu Lys Val Leu Asp Phe r^yr Thr Ala Phe. Ala Asn Ser Phe • 

650 * I . 655 . 660, r 

Ser Arg Asn !Leu Thr Leu Thr Cys Asn Leu Arg Lys Arg He He 

665 '670 675 

Phe Arg Lys He Ser Asp Val Lys Lys Glu Glu Glu 'Glu Arg Leu 

680 685* 690 ■ 

Arg Gin Lys Asn Glu Val Thr Leu Ser He Pro Val Asp His Pro 

695 700 705 

Val Arg Lys Leu Phe Gin Lys Phe : Lys Gin Gin Lys* Glu Leu Arg 

710 , 715 . 720 

Asn Gin Gly Ser Thr Gin Gly Asp Pi^o Glu Arg *Asn Gin Leu Gin 

725 ' 730 ' 735 

Val Glu Ser Arg ^Ser Leu Gin Asn Gly Ala Ser He Thr Gly Thr 

740 745 . 750 

Ser Val Val Thr Val Ser Gin ,Ile Thr Pro He Gin Thr. Ser Leu 

755* 760 765 

Ala Tyr val Lys Thr Ser Glu Ser Leu Lys Gin Asn Asn Arg Asp 
• .770 .775 780 

Ala Met Glu Leu Lys Pro Asn Gly Gly Ala Asp Gin Lys cys Leu 

785 790 . • 795 

Lys Val Asn Ser Pro He Arg Met Lys Asn Gly Asn Gly L^s Gly 

800 805 810 

Trp Leu Arg Leu Lys Asn Asn Met Gly Ala His Glu Glu Lys Lys 

815 , » V 820 825 

Glu Asp Trp Asn Asn Val Thr Lys Ala Glu Ser Met Gly ' Leu Leu 
.830, 835 840 

Ser Glu Asp Pro Lys Ser Ser Asp Ser Glu Asn Ser Val .Thr Lys 

845 ' 850 855 

Asn Pro Leu Arg Lys Thr Asp Ser Cys Asp Ser Gly He Thr Lys 

860 865 870 

Ser Asp Leu Arg Leu Asp Lys Ala Gly Glu Ala Arg Ser Pro Leu 

875 880 885 

Glu His Ser Pro He Gin Ala Asp Ala Lys His Pro Phe Tyx Pro , 

890 8*95 900 

He Pro Glu Gin Ala Leu Gin Thr Thr Leu Gin Glu Val Lys His 

i905 910 915 

Glu Leu Lys Glu Asp He Gin Leu Leu Ser Cys Arg Met Thr Ala 

920 925 930 

Leu Glu Lys Gin Val Ala Glu He Leu Lys lie Leu Ser Glu Lys 
. 935 940 , 945 

Ser Val Pro Gin Ala Ser Ser Pro Lys Ser Gin Met Pro Leu Gin 

950 ' 955 960 

Val Pro Pro Gin He Pro Cys Gin Asp He Phe Ser Val Ser Arg 

965 970 975 

Pro Glu Ser Pro Glu Ser Asp Lys Asp Glu He His Pile 

980 985 

<210> 32 . ' ' 

<211> 533 . 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 7477725CD1 

<400> 32 

Met Ala Phe Glu Glu Leu Leu Ser Gin Val Gly Gly Leu Gly Arg 
.1 5 10 15 

Phe Gin Met Leu His Leu Val Phe He Leu Pro Ser Leu Met Leu 

20 25 30 

Leu He Pro His He Leu Leu Glu Asn Phe Ala Ala Ala He Pro 

35 40 45 

Gly His Arg Cys Trp Val His Met Leu Asp Asn Asn Thr Gly Ser 
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Glu 




50 










55 








• 60 


Gly 


Asn 


Thr 


Gly 
65 1 


He 


Leu 


Ser 


Glu 


Asp 
»70 


Ala 


Leu 


Leu Arg 


He 
75 • 


Ser 


He 


Pro 


Leu 


Asp 


Ser 


Asn Leu Arg 


Pro 


Glu 


Lys 


Cys Arg Arg* 










.80 










85 








90 r 


Phe 


Val 


His Pro 


Gin 


Trp 


Gin 


Leu 


Leu His 


Leu 


Asn 


Gly Thr 


He 










95 










100 






105 


His 


Ser 


Thr 


Ser 


Glu 


Ala 


Asp 


Thr . 


Glu 


Pro 


cys 


Val' 


Asp Gly Trp 










110 










115 








120 


Val 


Tyr 


Asp 


Gin 


Ser 


Tyr 


Phe 


Pro 


Ser 


Thr 


lie Val 


Thr Lys 


Trp 










125 










13 0 






135 


Asp 


Leu 


Val 


Cys 


Asp 
140 


Tyr 


Gin 


Ser. 


Leu 


Lys 
145 


Ser 


Val' 


Val Gin 


Phe 
150 


Leu 


Leu 


Leu 


Thr 


Gly Met 


Leu Val Gly Gly He' 


lie 


Gly Gly 'His 










155 










160 








165 


Val 


Ser 


Asp 


Arg 


.Phe 
17 0 


Gly 


Arg 


Arg 


Phe 


He 
175 


Leu 


Arg 


Trp Cys 


Leu 
180 


Leu 


Gin 


• 

Leu 


Ala 


lie Thr 


Asp 


,Thr 


Cys Ala Ala 


Phe 


Ala Pro 


Thr 










185' 










190 








195 


Phe 


Pro 


Val 


Tyr 


Cys 


Val 


Leu Arg 


Phe 


Leu Ala Gly 


Phe Ser 


Ser 










200 










205 








210 


Met 


He 


lie 


He 


Ser. 


Asn 


Asn 


Ser 


Leu 


Pro He 


Thr 


Glu TiTp 


He 










215 










220 






225 


Arg 


Pro 


Asn 


Ser 


Lys 


Ala 


Leu 


Val 




He 


Leu 


Ser 


Ser Gly Ala 










230 










235 








240 


Leu 


Ser 


He Gly 


.Gin 


He 


He 


Leu Qly Gly 


Leu 


Ala 














245 




1 


i 




250 






255 


Arg 


Asp 


Trp 


Gin 
♦ 


Thr 
260 


Leu 


His 


Val 


Val 


Ala 
265 


Ser 


Val 


Pro* Phe 


Phe 
270 


Val 


Phe 


Phe 


Leu 


Leu Ser 


Arg Trp Leu Val 


Glu 


Ser 


Ala - Arg 


Tip 










275 










280 








285 


Leu 


lie 


He 


Thr 


Asn 


Lys 


Leu Asp 


Glu 


Gly 


Leu 


Lys 


Ala Leu 


Arg 










290 










295 








300 


Lys 


Val 


Ala Arg 


Thr 


Asn 


Gly 


He 


Lys 


Asn 


Ala 


Glu 


Glu Thr 


Leu 










305 










310 








315 


Asn 


He 


Glu 


Val 


Val 


Arg 


Ser 


Thr 


Met 


Gin Glu 


Glu 


Leu Asp 


Ala , 










320 










325 






330 


Ala 


Gin 


Thr 


Lys 


Thr 
335 


Thr 


Val 


Cys 


Asp 


Leu 
340 


Phe 


Arg 


Ash Pro 


Ser 
345 


Met 


Arg 


Lys 


Arg 


•He 
350 


Cys 


He 


Leu 


Val 


Phe 
355 


Leu 


Arg 


Phe Ala 


Asn 
360 


Thr 


He 


Pro 


Phe 


Tyr 


Gly 


Thr 


Met 


Val 


Asn 


Leu. Gin 


His Val 


Gly 










365 










370 








375 


Ser 


Asn 


lie 


Phe 


Leu 


Leu 


Gin 


Val 


Leu 


Tyr Gly Ala 


Val Ala 


Leu 




Val 






380 










385 








390 


lie 


Arg 


Cys 


Leu 


Ala 


Leu 


Leu 


Thr 


Leu 


Asn 


His 


Met Gly 


Arg 










395 










400 






405 


Arg 


He 


Ser 


Gin 


He 


Leu 


Phe 


Met 


Phe 


Leu 


Val 


Gly 


Leu Ser 


He 










410 










415 






420 


Leu 


Ala 


Asn 


Thr 


Phe 


Val 


Pro 


Lys 


Glu 


Met 


Gin 


Thr 


Leu Arg 


Val 










425 










45 0 






435 


Ala 


Leu 


'Ala 


Cys 


Leu 


Gly 


He 


Gly 


Cys 


Ser 


Ala Ala 


Thr Phe 


Ser 










440 










445 








450 


Ser 


Val 


Ala 


Val 


His 
455 


Phe 


He 


Glu 


Leu 


He 
460 


Pro 


Thr 


Val Leu 


Arg 
465 


Ala 


Arg 


Ala 


Ser 


Gly lie 


Asp 


Leu 


Thr. 


Ala 


Ser 


Arg 


He Gly Ala 










470 










475 








480 


Ala 


Leu 


Ala 


Pro 


Leu 


Leu 


Met 


Thr 


Leu 


Thr Val 


Phe 


Phe Thr 


Thr 










485 










490 








495 


Leu 


Pro 


Trp 


He 


He 


Tyr 


Gly 


He 


Phe 


Pro 


lie 


He 


Gly Gly 


Leu 










500 










505 






510 


He 


Val 


Phe 


Leu 


Leu 


Pro 


Glu 


Thr 


Lys 


Asn 


Leu 


Pro 


Leu Pro 


Asp 










515 








520 








525 


Thr 


He 


Lys 


Asp 


Val 
530 


Glu 


Asn 


Gin 















<210> 33 
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<211:5^ 1775 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 3474673CB1 



<400> 33 
atttcaggaa 
gcttcctcct 
gggccagatg 
gtggggccag 
aggcgtttgc 
gtccaccgct 
gcggctcccg 
gcctacctgg 
gcgcaggact 
tgtctggacc 
^g^gccagcc 
ttcttttctg 
gctgcccgcc 
etcaaccgac 
, ggcacctggc 
ggcctcctgc 
tacacagagg 
gtgattggaa 
tggatcctct 
gagacgccag 
caaagctgga 
tatccagagg 
tgtggcaagg 
ctgattttaa 
gaggctgggt 
ggtgacatgg 
actttagcgg 
ccatgaagta 
tgatcacaga 
ataaacatct 



atgtgagggg 
tctGCCtggc 
cccacagttc 
gctgtcccat 
gagaggagat 
cccggcgccg 
agggcagggt 
cttacctggc 
ccagccgcag 
gcccggcgct 
tcctcagcaa 
tgtccaccat 
tcttctgcat 
tggggcatct 
aggatcctga 
tcttcctgct 
gcttctactt 
tgaacccctc 
ttgggatggc 
ggagggtatg 
gacagggacc 
gacccatggg 
acagctagtt 
gctttgcaca 
gcagctatat 
gatgtgactt 
gctgcaatgc 
gcggcaatgt 
gccatcctaa 
tatgtaaaat 



gctctgggcc 
ctaggcctag 
agagcctctt 
t^ccccaacac 
acgagctgga 
gctccccgcc 
ccggggctgc 
gctgggcacc 
cttdcagcgc 
ggactcgctg 
caccaqcagc 
caccaccatt 
cttctttgcG 
catgcagcag 
caaggcgcgg 
gctgccaccg 
cgccttcatc 
ccagaggtac 
atggctggcc 
ttcctgctgc 
tgaccgggag 
aatcatacag 
atactccatt 
tgtccaccca 
gattaattct 
tcgggtgtcg 
cgccgatatg 
ttgagcggca 
caaacggaat 
accaaaaaaa 



ccttccctca 

caggtgggca 

tjttgtcccgg 

tcctcctccc 

cgcctggqcc 

tctcccgcta 

gcggtgccca 

ggcgtgttct 

gacaagtggg. 

atccgggatg 

atggggcgct 

ggctatggca 

cttgtgggga 

ggagtaaacc 

tggctggcgg 

ctgctcttct 

accctcagca 

ccactgtggt 

ttgatcatca 

caccacagct 

ccagagtccc 

catctggaac 

ctttggtcgt 

aactaaagac 

gcccaatagg 

gggcagcatg 

atggctggga 

caataagata 

atcacccgac 

aaaaa 



gcgcctgcgg 
ccccgcacac 
ggattggatc 
cggcgaaadc 
ttcccticcca 
tgtaccgacc 
gcaccgtgct 
ggacgctgga 
ag.ctgttgc'a 
tcgtccaagc 
gggagctcgt 
acctgagcGC 
tcccactcaa 
actgggccag 
gctctggcgc 
cccacatgga 
ccgtgggctt 
acaagaajCat 
aactcatcct 
ctaaggaaga 
actccccaca 
cttctgctca 
cgtcctcggt 
tacattttcc 
gtatacagag 
cccttctccc 
gctctggcag 
ggaagagtct 
ctcctttatg 



tcacccagca 
atttgaggcg 
ccagggctgg 
gggcaccagc 
ccgggtccta 
gcgagcccgg 
cctgctgctc, 
gggccgcgcg 
gaacttcacg 
atacaaaaac 
gggctccttc 
caacacgatg 
cctcgtggtg 
caggctgggg 
cctcctctcg 
gggctggagc 
cggcgactac 
ggtgtccctg 
ctcccagctg 
cttcaagtcG 
gcaaggatgo 
GgctgcaggG 
agoaagaGCG 
atcGaGCcta 
acatgtGGtg 

GCaGttGGtt 

ccataGggca 
ggatctGtga 
tgagagagaa 



<210> 34 

<211> 1545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Inoyte ID No: 



4588877CB1 



<400> 34 

aatgagggoc 

ggctggaaat 

cacgatggct 

cgtggtgctc 

gctggggggc 

gaatgagggg 

ctggtgaggt 

ggcattaatg 

ccaggcaggg 

ctctggcctg 

ggggacaata 

cctcctctcg 
gggctggagc 
cggcgactac 
ggtgtccctg 
ctcccagctg 
cttcaagtcc 



ctgggggttg 
gtcccctgag 
gcccgcctct 
aaccgactgg 
acctggcagg 
ctgtggtggg 
tgtgttggga 
gtggagatgg 
aacctggaga 
cctccagctc 
ttgcacccac 
ggcctcctgc 
tacacagagg 
gtgattggaa 
tggatcctct 
gagacgccag 
caaagctgga 



ggcccaggag 
tgccccctct 
tctgcatctt 
ggcatctcat 
tgagggggct 
aattggggtt 
ttgggggtca 
ggtgggactg 
acacagacca 
cctcggctcc 
aggatcctga 
tcttcctgct 
gcttctactt 
tgaacccctc 
ttgggatggc 
ggagggtatg 
gacagggacc 



tggggctgtg 
cacctcaggc 
ctttgccctt 
gcagcaggga 
gctggacggg 
actaatgaca 
ctgctcagaa 
ggcagacagg 
ggtgaagagc 
cacgcatggg 
caaggcgcgg 
gctgccaccg 
cgccttcatc 
ccagaggtac 
atggctggcc 
ttcctgctgc 
tgaccgggag 



gtggtgagtg 
tatggcaacc 
gtggggatcc 
gtaaaccact 
gtggggatgg 
agaggtggga 
tagggtcctt 
aaggacatga 
ccccttctta 
gtgaaggcct 
tggctggcgg 
ctgctcttct 
accctcagca 
ccactgtggt 
ttgatcatca 
caccacagct 
ccagagtccc 



60 

120 

180 

240 

300 

360. 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1775 



gacagggctg 
tgagqcccaa 
cactcaacct 
gggccagcag 
gtcacttcta 
gcaagtgtta 
agtgaaaaag 
ggcacaggct 
ctggggacag 
caggaggcct 
gctctggcgc 
cccacatgga 
ccgtgggctt 
acaagaacat 
aactcatcct 
ctaaggaaga 
actccccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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gcaaggatgc 
cgctgcaggc 
agcaagaccc 
atccacccta 
acatgtcctg 
ccacttcctt 
ccatacggca 
ggatctctga 
tgagagagaa 



tatccagagg 
tgtggcaagg 
ctgattttaa 
gaggctgggt 
ggtgacatgg 
actttagcgg 
ccatgaagta 
tgatcacaga 
ataaacatcb 



gacccatggg 
acagctagtt 
gifttbgcaca 
gcagctatat 
gatg^tgactt 
gctgcaatgc 
gcggcaatgt 
gccatcctaa 
tatgtaaaat 



aatcatacag 
atactccatt 
tgtccacqca 
gattaattct 
tcgggtgtcg 
cgccgatatg 
tt'gagcggca 
caaac^gaat 
accaaaaa^ 



catctggaac 
ctttggtcgt 
aactaaagab 
gcccaatagg 
.^rggcajgcatg 
atggctggga 
caataagata 
atCcK:ccgac 
aaaiaa. 



attctgctca 
cgtcctcggt 
tacattttcc 
gtafcacagag 

CCCttyCtCCC 

gctctggcag 
ggaagagtct 
ctcctttatg 



1080 

ii4o 

1200 
1260 
1320 
1380 
1440 
1500 
1545 



<210> 35 • 
<211> 1941 
<212> DNA 

<213> Homo sapiens • i 

<220> . 

<221> raise feature 

<223> Incyte ID No: ' 74722 149BI 



<400> 35 
atggcggaga 
gccgtgacgc 
gagaaccaca 
atcgagttcg* 
tataagaccc 
atgggcccct 
tctggaatga 
aagatgtcct 
gccatgatgg 
gtgacagaga 
ctctctggcg 
gtcatgttct 
tccctcatga 
agtgccciagc 
ttcaaaggcg 
acctaccaca 
aaccccatgt 
agccctgaga 
attgaaagcc 
accttcctgt 
gttattggcg 
ttcaacaaca 
ccaactgtgc 
tggtacagcc 
. gtgtgtccgg 
. agccgcttcc 
gggctgctga 
accgccatcc 
tacctgcaat 
acgatctatg 
gagccacaga 
ctggtcttgg 
cgggtcaagt 



aggcgctgga 
tggaggacgg 
tcactgaagc 
tggagc.tgtc 
ttctcaagtg 
caggggctgg 
aggggcagat 
gctacatcat 
tgtctgctaa 
tcctgacggc 
ggcagaggaa 
ttgatgagcc 
agtccctggc 
tctttgagat 
tggtcaccaa 
acccggctga 
tgttcagggc 
agaacgaggt 
acacctttgc 
ccatcctcag 
tgctcatcgg 
ccggctgcct 
tcaccgtccc 
tcaaagcgta 
tggtctactg 
tgctcttctc 
tcggagctgc 
ctgtcctctt 
ggagctccta 
gcatggagcg 
gcatcctccg 
gcatcttctt 
cagagagata 



ggccgtgggc 
ggcggaaccc 
ccagcgcttc 
ctattccgtg 
cctctcaggt 
caagtctaca 
cc^g^ttaat 
gcaagatgac 
cctgaatctt 
actgggcctg 
gcgtctggcc 
caccagtggt 
acaggggggc 
gtttgacaag 
cctgatcccc 
cttcgtcatc 
tgtgcagaat 
ccctgcccca 
caccagcacc 
ggacacggtg 
cctcctctac 
cttcttctcc 
cttagagatg 
ttacctggcc 
cagcattgtg 
agccctggcc 
ttccaactcc 
gttctccggc 
tctctcctat 
aggagacctg 
agcgctggat 
cctagccctg 

g 



tgtggactag 

cctgtgctga 

tcccacctac 

cgggaggggc 

aaattctgcc 

ttcatgaaca 

ggaaggccac 

atgctgctgc 

actgagaatc 

atgtcgtgct 

atcgccctgg 

ctggatagcg 

cgtaccatca 

ctctacatcc 

tatctaaagg 

gaggtggcct 

gggcttgtgcg 

tgccctcctt 

ctcacacagt 

ctgacccacc 

ctgcatattg 

atgctgttcc 

gcggtcttca 

aagaccatgg 

tactggatga 

accgccaccg 

ctacaggtgg 

ttctttgtca 

gtcaggtatg 

acatgtttag 

gtg^aggatg 

cggctgctgg 



ggccgggggc 

ccacgcacct 

ccaagcgctc 

cctgctggc^ 

gccgggagct 

tcttggcagg 

gggagc tgag 

cgcaccfcac 

ccgatgtgaa 

cccacacgag 

agctggtcaa 

cctcttgttt 

tctgcaccat 

tgagccaggg 

gactcggctt 

c tggcgagta 

ctatggctga 

gtcctccgga 

tctgcatcct 

tacggttcat 

gcgacgat^c 

tcatgttcgc 

tgagggagca 

ctgacgtgec 

cgggccagcc 

ccttggtggc 

ccacttttgt 

gcttcaagac 

gctttgaggg 

aggaacgctg 

ccaagctcta 

cctaccttgt 



tgtggccatg 
gaagaaggtg 
agccgtggac 
caaaaggggt 
gattggcatc 
atacagggag 
gaccttccgc 
ggtgttggaa; 
aaacgatctc 
gacagccctg 
caacccgcct 
ccaagtggtg 
ccaccagccc 
tcagtgcatc 
gcattgcccc 
tggagacctg 
gaagaagagc 
agtggatccc 
cttcaagagg 
gtcccacgtg 
cagcaaggtc 
cgccctcatg 
cctcaactac 
ctttcaggtg 
cgctgagacc 
ccaatctttg 
gggcccagtt 
catccccact 
tgtgatcctg 
cccgttccgg 
catggacttc 
gctgcgttac 



60 

120 • 

180 

240 

300 

360 

420. 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1941 



<210> 36 
<211> 4971 
<212> UNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7473053CB1 

<400> 36 

caaagtagcg ggccgaggcc cgggggagcg gggccgcagc tgggggggcg ggagcccgtg 60 
gggagccgag ccgagcgccc cccgccccag cccccggcat gggcagtacg gggccgccgg 120 
ggcgggcgcc gagcgctgag cgctgagggt ctcccatggg attgctggga tcttgctggg 180 
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tgagatggca gtgtgtgcaa aaaagcgccc cccagaagaa gaaaggaggg cgcgggctaa 240 . 
tgaccgagaa tacaatgaga, aattccagta tgcgagtaac tgcatcaaga cctccaagta 300 
caatattctc accttcctgc ctgtcaacct ctttgagcag ttccaggaag. ttgccaacac 3 60 
ttacttcctg ttcctcctca ttctgcagtt gatcccccag atctcttccc tgtcctggtt 420 
caccaccatt gtgcctttgg ttcttgtcc^ caccatcaca gqtgttaaag atgccactga 480 
tgactatttc cgccacaaga gcgataacca ggtgaataac cgcca^tctc aggtgctgat 540 
caacggAatc ctccagcagg agcagtggat gaatgtctgt gttggtgata ttatcaagct. 600 
agaaaataac cagtttgtgg. cggcggatct cctcctcctt t^cagcagtg agccccatgg 660 
gctgtgttac atagagacag cagaacttga tggcgagacc aacatgaaag tacgtcaggc 720 
gattccagtc acctcagaat tgggagacat cagtaagctt gccaagtttg acggtgaagt 780 
gatctgtgaa cctcccaaca acaaactcfga caaattcagc ggaaccctct actggaagga 840 
aaataagttc cctctgagca acca:gaacat gptgctgcgg ggctgtgtgc tgcgaaacac 900 
cgagtggtgc ttcgggctgg tcatctttgc aggtcccgac actaagct^a tgcaaaacag 9*60 
cggcagaaca aagttcaaaa gaacgagtat cgatcgccpta atgaataccc tggtgctctg 1020 
gatttttgga ttcctggttt gcatgggggt gatcctggcc attggcaatg ccatctggga 10$0 
gcacgaggtg gggatgcgtt tccaggtcta cctgccgtgg gatgaggcag tggacagtgq 1140 
cttcttctct ggcttcctct ccttctggtc ctacatcatc atcctcaaca ccgttgtgcc 1200 
catttcactc tatgtcagtg tggdggtcat ccgtctgggc. cacagctact tcatcaactg 1260 
ggataagaag atgttctgca tgaagaagcg gacgcctgca gaagcccgca ccaccaccct 1320 
'aa^acgaggag ctgggccagg tggagt^acat cttctccgac aagacgggca ccctcacccai 1380 
gaacatcatg gttttcaaca agtgctccat caatggccac agctatggtg atgtgtttga 1440 • 
cgtcctggga cacaaagctg aattgggaga gaggcctgaa cctgttgact tctccttcaa 1500 
tcctctggct gacaagaagt tcttattttg ggacGccagc ctgctggagg ctgtcaagat 1560 
» cggggacccc cacacgcatg agttcttccg cctcctttcc ctgtgtcata ctgtcatgtc 1620 
agaagaaaag aacgaaggag agctgtacta caaagctcag tccccagatg agggggccct 1680 
ggtcaccgca gccaggaact ttggttttgt tttccgctct cgcaccccca aaacaatcac 1740 
cgtccatgag atgggcacag ccatcaccta ccagctgctg gccatcctgg acttcaacaa 1800 
catccgcaag cggatgtcgg tcatagtgcg gaatccagag gggaagatcc gactctactg 1860 
caaaggggct gacactatcc tactggacag actgcaccac tccactcaag agctgctcaa 1920 
caccaccatg gaccacctta atgagtacgc aggggaaggg .ctgaggaccc tggtgctggc 1980 
ctacaaggat ctggatgaag agtactatga ggagtgggct gagcgacgcc tccaggccag 2040 
cctggcccag gacagccggg aggacaggct ggctagcatc tatgaggagg ttgagaacaa 2100 
catgatgctg ctgggtgcaa cggccattga ggacaaactt cagcaagggg ttccagagac 2160 
cattgccctc ctgacactgg ccaacatcaa gatttgggtg ctaaccggag acaagcaaga 2220 . 
gacggctgtg aacatcggct attcctgcaa gatgctgacg gatgacatga dtgaggtttt 2280 
catagtcact ggccatactg tcctggaggt gcgggaggag ctcaggaaag cccgggagaa 2340 
gatgatggac tcatcccgct ctgtaggcaa cggcttcacc tatcaggaca agctttcttc 2400 
ttccaagcta acttctgtcc tggaggccgt tgctggggag tacgccctgg tcataaatgg 2460 
tcacagcctg gcccacgcac tggaggcaga catggagctg gagtttctgg agacagcgtg 2520 
tgcctgcaaa gctgtcatct gctgccgggt gacccccttg cagaaggcac aggtggtaga 2580 
actggtcaag aagtacaaga aggctgtgac gcttgccatt ggagacggag ccaatgatgt 2640 
cagcatgatc aaaacggctc acattggtgt ggggatcagt gggcaggaag ggatccaggc 2700 
tgtcttggcc tccgattact ccttctccca gttcaagttc ctgcagcgcc tcctgctggt 2760 
gcatgggcgc tggtcctacc tgcgaatgtg caagtttctt tgctatttct tctacaaaaa 2820 
ctttgctttc accatggtcc acttctggtt tggcttcttc tgtggcttct cagcccagac 2880 
cgtctatgac cagtatttca tcaccctgta taacatcgtg tacacctccc tgccagtcct 2940 
ggctatgggg gtctttgatc aggatgtccc cgagcagcgg a^catggagt accctaagct 3000 
.gtatgagccg ggccagctga accttctctt caacaagcgg gagttcttca tctgcatcgc 3 060 
ccagggcatc tacacctccg tgctcatgtt cttcattccc tatggggtgt ttgctgatgc 3120 
cacccgggat gatggcactc agctggctga ctaccagtcc tttgcagtca ctgtggccac 3180 
atccttggtc attgtggtta gcgtgcagat tgggctcgac acaggctact ggacggccat 3240 
caaccacttc ttcatctggg gaagccttgc tgtttacttt gccatcctct ttgccatgca 3300 
cagcaatggg ctcttcgaca tgtttcccaa ccagttccgg tttgtgggga atgcccagaa 3360 
caccttggcc cagcccacgg tgtggctgac cattgtgctc accacagtcg tctgcatcat 3420 
gcccgtggtt gccttccgat tcctcaggct caacctgaag ccggatctct ccgacacggt 3480 
ccgctacaca cagctcgtga ggaagaagca gaaggcccag caccgctgca tgcggcgggt 3540 
tggccgcact ggctcccggc gctccggcta tgccttctcc catcaggagg gcttcgggga 3 600 
gctcatcatg tctggcaaga acatgcggct gagctctctc gcgctctcca gcttcaccac 3660 
ccgctccagc tccagctgga ttgagagcct gcgcaggaag aagagtgaca gtgccagtag 3720 
ccccagtggc ggtgccgaca agcccctcaa gggctgaagg ccgaggatgg atgccctgtg 3780 
ccagtgacca gagcacccag ggctggccag tcactgaggg aacagcgtct cggaactgct 3840 
ggtcctcatt ccttgcttcc cgtccccccg gtagactctg tcctgctggt cccacoacac 3 900 
atggctggga catctgttcc cagctgtagg cccttccacc agctggggag ctagagggag 3960 
caggcccaag ggcagagcag aggctgaggc acggggagcc agccccactc ggggaccaga 4020 
agtggaacca aaaacaagaa aaaactgtga gagattgtgt ctgcccctgc cctgcctggg 4080 
acccacaggg agactataat ctccttattt ttttactcct actccccaga ggggccctag 4140 
tgcctctgtt cctgaattac ataagaatgt accatgccgg gaagccagag acctgcaggg 4200 
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gcctcggccc 
tgtctttttt 
gtcttaaqtc 
tgtggggtgg 
gcagaggagc 
ggagggtttg 
acacac&cac 
gaaagtggcc 

ggggtagttt 

tcactcttgt 
ctcctgggtt 
ttcttaaatt. 
ggagcaaatt 



ctcacatcgt 
tatttggcaa 
tcctgggaaa 
tttgggagga 
.tgaccaggga 
gtcctgtctg 
atatcaattc 
cccaaatcct 
tcttttcttt 
cacctaggca 
caagcgattc 
cttctattct 
aaagaggtta 



gtatgtctct 
gtggaggagg 
aggaggctgg 
tccatgtcgg 
gggaggtacc 
cttcctcacc 
ctggattcct 
tagtgagcta 
tttctttttc 
agagtgcaat 
tcctgcctcc 
cagacacggg 
cgagttaaaa 



ccttgatttg 
cttttatgtg 
cacacac tg'g 
ctctgcctgc 
catgagcagjEi 
ttgagagtaa 
tagtcctgct 
aagctgggtc 
tttttttctt 
ggcacaatct 
ccggacccaa 
tctagtacca 
tcctaaaaaa 



tgttgtgtcc 
acttttatgt 
gatgccgcag 
agtgaccagt 
gggtagtggg 
agtgctgcrcc 
g^ccttgggc 
tgaa^tttgg 
tttttttttg 
cagctcactg 
ccactggact 
ttccttcctc 
aaaaaaaaaa 



agtttggttt 
tgtggttggt 
cctggccggc 
gctctgtggg 
agagtgtaaa 
tctgccccca 
. tggagcctag 
tcagtgggga 
agatggagtc 
caacctccac 
taatctcact 
ttagccccag 
a • 



<210> 37 
<211> 1404 
<212> DNA 

<213> Homo sapiens * 

'<220.> . ... 

<221> inisc_f eature 
<223> incyte. ID.No: 7473347CB1 



I <400> 37 
atggtcctgg 
gtttgtgctg 
acctgcaaac 
tatgagcaac 
tctccagtgc 
aacatggact 
tttcctagca 
gtgcctgata 
aatatcatgc 
tcggccatgt 

. gaactggaaa 
aagtccttaa 
gcatctagtg 
tctgtgctaa 
atggtgatgc 
tccctgggaa 
atgccccagg 
gtgttcctgt 
aaacaattca 
atggcctttg 
ctaaactcag 
cggataaaga 
catgtcattg 
.ttttactggg 



ctttccagtt 
cttctaacat 
aagaaactag 
ttctccatat 
cagtaggtat 
ttacaatgac 
cagcaaacaa 
tcttttttgt 
tgcgcgtaca 
gctttatgga 
gctatgccta 
atactgaaga 
gattagcttt 

ggaggcatgt 

tttcatgggt 
tcaccacagt 
tgtcctacct 
cagtcattga 
agaagacagg 
atggttgtta 
aagacttcat 
gaagaaaatc 
acacctattc 
gtgtatatgt 



agtctccttc 
caagatgaca 
aatgaagaaa 
sgaggacaac 
agatgtccat 
tttttatctc 
aagcatgaca 
ccactctaaa 
ccctgatgga 
tttcagcagg 
caatgaggat 
acatatgtcc 
ctatagcagc 
tttcttcttt 
ttcattttgg 
gctgaccatg 
caaggctgtg 
gtatgcagct 
aaagatttct 
ccatgacagc 
gagaagaaaa 
cctaggagga 
taggatttta 
atga 



acctacatct 
caccagcggt 
gatgacagta 
gatttcgcaa 
gttgaaagca 
aggcattact 
tttgatcata 
agatccttca 
aacgtcctcc 
tttcctcttg 
gacctaatgc 
ctttctcagt 
acaggctggt 
gtgctgccaa 
attgaccgaa 
tccacaatca 
gatgtgtacc 
gtgaactacc 
aggatgtaca 
gagattgaca 
tcgatatgca 
catgttggta 
ttccccattg 



ggatcatatt 
gctcctcttc 
ccaaagcgcg 
tgagacctgg 
ttgacagcat 
ggaaagacga 
gattgaccag 
tccatgatac 
taagtctcag 
acactcaaaa 
tatactggaa 
tcttcattga 
acaataggct 
cctattaccc 
gage tgt tec 
tcactgctgt 
tgtgggtcag 
tcaccacagt 
atattgatgc 
tggaccagac 
gccccagcac 
gaatcattct 
tp^tatatttt 



gaaaccaaat 
aatgaaacaa 
gcctcagaaa 
atttggaggg 
ttcagagact 
gaggctctcc 
aaagatctgg 
aactatggag 
gataacggtt 
ttgttctctt 
acacggaaac 
agacttcagt 
tttcatcatc 
agccatattg 
tgcaagagtt 
gagcgcctcc 
ctccctcttt 
ggaagagcgg 
agttcaagct 
ttccctctct 
cgattcatct 
ggaaaacaac 
atttaatttg 



<210> 38 

<211> 4048 

<212> DNA 

<213> Homo, sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 7474240CB1 



4260 

4320 

4380 

4440 

4500 

4560. 

4620 

4680 

4740 

4800 

4860 

4920 

4971 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 . 
780 
840 
900 
96b 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
.1404 



<400> 38 

cttccatccc 
gcactgggaa 
ccgccccctt 
ccctcacccc 
tgcgaggccc 
tctagaagtg 
agagcggctc 
gctccggggg 



ccctcagcca 
attggcgctg 
cccatcccca 
gcctccttcc 
cctccggaag 
gtgctaatgg 
gcgcgctggc 
agcgctcctc 



ttccttactg 
tttgggagaa 
cccccaccgc 
cgttccccac 
gtgaggggaa 
gaagagaatt 
ctgggctcta 
ggaagaccgg 



ctctgggcaa 
gagaaacaga 
tc tctccctc 
ccccaaaccc 
tggattggac 
ctggtttcaa 
gccgaggaga 
ggccaacatg 



ccgccaggtt 
tcgattgccc 
tttccctccc 
tctcacccgc 
tccggtggag 
aagaggatgc 
gatcccggga 
cctgtgcgca 



aagcccattt 60 
ttgtgactcc 120 
ccgccacctc 180 
ggcagtccgg 240 
aaagcgggtg 3 00 
tctgccacaa 3 60 
ggactccaga 420 
gggggcatgt 480 
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ggcaccacaa 
atttatcatt' 
ctgtgagatg 
tctccatgga 
gtcagaagag 
taacactcac 
ttttgaatat 
accaatcaaa 
cacttacaga 
taaacacagt 
cagcccctct 
ggtgaatata 
ccctgacatg 
tagtatacgg 
catatttaga 
tcatatcaag 
cattaacaag 
ttcatcacct 
caatgtgact 
caaactgcag 
agtctggga,c 
tgcagccttc 
tagccctttg 
aaacttcaga 
agcaatacac 
cttgctgatt 
tgcccgactc 
cgctgctgtt 
catttggtat 
ggattcctta 
atccattaaa 
aggattcggg 
gttgattggc 
actatactcg 
ctttcaccaa 
gacttacacc 
agcagacatt 
ggcaagtaaa 
^?0^?^cacc 
ctccattgaa 
tggagaaatg 
cacatactgt 
tcctgagttt 
gagcgcaaag 
taaactaaga 
tgatcctgaa 
agagaaaaaa 
cttctcagga 
agaaacagtg 
tatcactgac 
tccatctgca 
cggggaagtg 
aatgaccact 
cccagcctac 
gagaaccagt 
atgtcctgaa 
tggggtgcat 
tgatctctct 
tccttctttg 
ttctgatcct 



aatacatttc 
gcaaatgcca. 
actggtttp^t 
cccgagacca 
aggaaagtgg 
ataattccag 
,gtgacggata 
actgtaaacc 
aagcagtect 
gatgattcag 
gaagcagatg 
tccggacctc 
ctgcagtcaa 
agagcatctt 
gaccgacatg 
tcaagcctcc 
attccacagc 
'ccttcttcag 
gagaaa^tga 
acaccacgca 
tggc.ttatcc. 
ctcctcaatg 
aatgtggtag 
acaacatatg 
tacttcaaag 
tttggatcag 
ctccgtcttg 
ctaatgctct 
gc^attggga 
ggacagcaaa 
gacaaatacg 
aatgtgtctc 
tcactaatgt 
ggaactgcca 
atccccaacc 
aatggcattg 
tgtctacatc 
ggttgcctta 
ctcgttcact 
attctcaaag 
gttcatcttt 
gacttgcata 
tctgatcact 
gctgatctcc 
agaaggaaat 
gactctgcag 
agcagatcct 
atagtagact 
cccacctcag 
atgcgaagct 
cttcagcgag 
gaacaaagat 
gacatccaga 
agtatggtaa 
caaccggaag 
tttctagacc 
ctgaacacag 
ttagagcttc 
ccagattcat 
ggtcttccag 



tggggaccat 
gagtgcagaa 
ccaggKDcaga 
agaggcatga 
aggtcaccta 
tga^aaacca 
atgaaaacgc 
ggaaattttt 
taccacaaga 
tagccatgaa 
acacaaaagc 
ttgaccattc 
gttcccagct 
cggtjccatga 
ccag.cgaaga 
tgggatccac 
tcactctgaa 
ataaaaccat 
cccag^ttct 
tcaacaagtt 
tgctgttggt 
acagagaaga 
acttgattgt 
taaatcagaa 
gctggttcct 
gttctgatga 
tgcgagtggc 
taatgtgcat 
atgtagaaag 
ttgggaaacg 
tcacagcact 
ctaacacgaa 
atgcaagcat 
ggtaccacat 
ctctgaggca 
acatgaacat 
tcaaccagac 
gagctttggc 
gtggggatgt 
atgacattgt 
atgccaaacc 
agattcagcg 
ttctaacaaa 
tacgatcaca 
tgtcatttga 
ataccataag 
catctttcat 
cttctccagg 
gaagaatgca 
gggaacgaga 
ctgcctgggg 
tagatctgct 
ccatcttaca 
cagcaggatc 
catccatcaa 
ttgaaaaatc 
cttcagaaga 
acctgcggca 
ccc taagcac 
ggaaataa 



cattcggaaa 

ctgtgccatc 

tgtcatgcaa 

tattgccc'aa 

ctatcacaaa 

agagggcgtg 

tgctacccca 

tgggttcaaa 

agaccccgat 

gcattttaag 

tttgatacag 

ctctcccaaa 

gtcccattc9 

tatagaagga ' 

caatggtcgc 

atcagattca 

tttttcagag 

tattgcaccc 

ctctttagga 

tacgatattg 

catatacact 

acagaaaaga 

ggatattatg 

tgaagaagtg 

gattgacatg 

gacaacaaca 

caggaaactg 

ctttgccctg 

gccttacctg 

ttacaatgac 

ttattttacc 

ttcggagaaa 

ttttgggaat 

gcagatgctg 

acgtcttgaa 

ggtcctaa^g 

attgctgcaa 

aatgaagttc 

cctcactgca 

ggtggctatt 

tggaaagtct 

agaagacttg 

cctagagttg 

atccatgaat 

aagtgaagga 

acattatcag 

ctcctccatt 

aatagggaaa 

catagataaa 

aaatgcacat 

tatctctgaa 

ccaggagcaa 

gttgctgcag 

agaatatcag 

aactgaccga 

taaacttaaa 

caacttgact 

aagaaaaact 

tgtaggaatc 



tttgaagggc 
atttattgca 
aagccatgca 
attgcccagg 
aatgggtcca 
^c.tatgatgt 
gagagggtaa 
ttccctggtc 
gtggtggtca 
tctcctacaa 
cccagcaaat 
aggcaatggg 
agatcaaggg 
ttcggcgtcc 
aatgtcaaag 
aacctcaaca 
gtcaaaactg 
aaggttaaag 
gcagatgtcc 
Gactacagcc 
gctatattta 
cgagaatgtg 
tttatcatag 
gtaagtgat^ 
gttgcagcaa 
ttaattggtc 
gatcgatatt 
attgctcact* 
actgacaaaa 
agtgactqaa 
ttcagcagtt 
atcttttcaa 
gtatctgcaa 
cgagtaaaag 
gaatatttcc 
ggtttcccag 
aactgcaaag 
aaaaccaccc 
ctttatttct 
ctgggaaaaa 
aatgcagatg 
ttagaggttt 
actttcaacc 
gattcagaag 
gagaaagaaa 
agttccaaga 
gatgatgaac 
gcatctgggc 
agaagtcact 
ccccagcctg 
accg^aagcg 
cttaacaggc 
aaacaaacca 
agacccatca 
agtttcagcc 
tccaaagaat 
tcacttttaa 
tacgttcatc 
Srtgggtcttc 



a,aaataaaaa 
acgatgggtt 
cctgcgactt 
cattgctggg 
cttttatttgr 
tcatcattaa 
acccaatatt 
tgagagttct 
tcgattcatc 
aagaaagctg 
gttctccctt 
accgactcta 
aaagcttat'g 
accccaagaa 
ggccttttaa 
aatacagcac 
agaaaaagaa 
atcgaacaca 
tacctgaata 
ctttcaaggc 
ctcGctactc 
gctattcttg 
atattttaat 
ccgccaaaat 
ttccttttga 
ttttgaagac 
cagaatatgg 
ggctggcttg 
tcggatggtt 
gttctggacc 
taaccagtgt 
tttgtgtcat 
ttatccaaag 
agttcattcg 
agcacgcatg 
aatgctta:ca 
cctttbgggg 
atgcacctcc 
tatccagagg 
atgatatatt 
taagagccct 
tggatatgta 
taaggcatga 
gagacaactg 
acagtacaaa 
gacactttga 
aaaagccgct 
tcgattttga 
cttgcaaaga 
aagactccag 
acctcaccta 
ttgaatccca 
ctgtggtccc 
tccagctgat 
cttcctcaca 
ccctttcaag 
aacaagacag 
caattaggca 
ataggcatgt 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

13'20 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3 960 

4020 

4048 



<210> 39 

<211> 153 9 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> inisc_f eature 

<223> incyte ID No: 7475338CB1 

<400> 39 

^tggagaaca aagaggpggg aacc^cctcca qccattccat qcagggaggg gcggc^ccag, 60 
ccgacgctgt tgctggcgac actgagcgcg gcctttggct cagccttcca gtacggctac 120 
aacctctctg fcggtcaacac gccgcacaag gt^ttcaagt cattttacaa cgaaacctac 180 
tttgagcgac acgcaacatt catggacggg aagctcatgc tgcttctatg gtcttgcacc 240 
gtctccatgt ttcctctggg cggcctgttg gggtca!ttgc tcgtgggcct gctggttgat 300 
age tgcggca ggaaggggac cct get gate aacaacatc.t . ttgceatcat ececgccatc 360 
ctgatgggag tcagcaaagt ggccaaggct tttgagctga tegtcttttc ccgagtggtg 420 
ctgggagtct gtgcaggtat etcetacagc gcccttceca tgtacctggg agaactggcp 480 * 
cccaagaacc tgagaggcat ggtgggaaca atgaccgagg ttttcgtcat cgttggagtc 540 
ttcctagcac agatcttcag cctcpaggcc dtcttgggca 'acccggcagg ttggccggtg 60'0 
cttctggcgc tcacaggggt gcccgccctg ctgcagctgc tgaccctgcc cttcttcccc 660 
gaaagccccc gctaqtccct. gattcagaaa ggagatgaag ccacagcgcg acaagctctg 720 
aggaggctga gaggccacac ggacatggag gccgagctgg aggacatgcg tgcggaggcc 780 
cgggccgagc ^cgccgag^g ccacct^tct gtgctgcacc tctgtgccct gcggtccctg 840 
cgctggcage tcctctqc&t catcgtgctc atggccggcc agcagctgtc gggcatcaat 9 00 
gcgatcaact actatgcgga caccatetac acatctgcgg gcgtggaggc cgctcactcc 9 60 
caatatgtaa cggtgggctc . tggcgtegtc aacatagtga tgaccatcac ctcggctgtc 1020 
cttgtggagc ggctgggacg gcggeaecte ctgctggiccg gctacggcat ctgcggctct 1080' 
gcctgcctgg tgctgacggt ggtgctecta ttccaga^ica gggtccccga gctgtcctac 1140 
etcggeat.ca tctgtgtctt tgcetacatc gegggacatt ecattgggcd eagtcctgtc 1200 
ccctcggtgg tgaggaccga gatcttcctg cagtoctecc ggcgggeage tttcatggtg 1260 
gacggggcag tgcactggct caccaaettc atcataggct tcctgttcec atecatccag 1320 
gaggccatcg gtgcctacag tttca«t^atc tttgccggaa tctgcctcct eactgcgatt 1380 
tacatctacg tggttattcc ggagaccaag ggcaaaacat ttgtggagataaaccgqatt 1440 
tttgccaaga ga^acagggt gaagcttcca gaggagaaag aagaaaccat tgatgctggg 1500 
ecteccacag cctctcct^c caaggaaact tecttttag , 1539 

• 

<210> 40 

<211> 3114 

<212> DNA 

<213> Homo sapiens 

<22p> .... 

<221> misc_f eature 

<223> incyte ID No: 7476747CB1 

<400> 40 . 

ccaageagtg cctcacttct gccttgtcta gctgtactct ggaaaattaa gaaatttatg 60 
agtgtagcac caagtatacc aatgggaagg atgggagtca gaagtcaagt gaactcagce 120 
cgcctctgtg taetttgcac ttttecattt cccttggtac eaggoacttt catacttaat 180 
ccatagtgga gctgtcacag tgagcaactc tgacaatgac agcttctacc ccagaggega 240 
ccccaaacat ggagctaaag gctccagctg caggaggtct taatgctggc ectgtccccc 300 
cagctgccat gtccacgcag agacttcgga atgaagacta ccacgactac agctccaegg 3 60 
acgtgagccc tgaggagagc ccgtcggaag gcctcaacaa cctetectec ecgggctcet 420 
accagcgctt tggtcaaagc aatagcacaa catggttcca gaccttgatc cacctgttaa 480 
aaggcaacat tggcacagga ctcctgggac tccctctggc ggtgaaaaat gcaggcatcg 540 
tgatgggtcc catcagcctg ctgatcatag gcatdgtggc cgtgcactgc atgggtatcc 600 
tggtgaaatg . tgctcaccac ttctgccgca ggctgaataa atcctttgtg gattatggtg 660 
atactgtgat gtatggacta gaatccagcc cctgctcctg gctccggaac cacgcacact 720 
ggggaagacg tgttgtggac ttcttcctga ttgtcaccca gctgggattc tgctgtgtct 780 
attttgtgtt tctggctgac aactttaaac aggtgataga agcggccaat gggaccacca 840 
ataactgcca caacaatgag acggtgattc. tgacgcctac catggactcg cgactctaca 9 00 
tgctctcctt cctgcccttc ctggtgctgc tggttttcat caggaacctc cgagccctgt 960 
ccatcttcte cctgttggce aacatcacca tgetggtcag cttggteatg atctaccagt 1020 
tcattgttca gaggatccca gaececagce acetcccctt ggtggcccct tggaagacet 1080 
accctctctt ctttggcaca gegatttttt catttgaagg eattggaatg gttctgeecc 1140 
tggaaaacaa aatigaaggat cctcggaagt tcccacteat cctgtaectg ggcatggtca 1200 
tcgtcaccat cetctacatc agcctggggt gtctggggta cctgcaattt ggagctaata 1260 
tccaaggcag cataaccctc aacctgccca actgctggtt gtaccagtca gttaagctgc 1320 
tgtactccat cgggatcttt ttcacctacg cactccagtt ctacgtcccg gctgagatca 1380 
tcatcccctt ctttgtgtcc cgagcgcccg agccctgtga gttagtggtg gacctgtttg 1440 
tgcgcccagt gctggtctgc ctgacatcac tgtctggcag tgttgacaat ggctggtatg 1500 
gcacggaagc cgatggcacc tcctgcggca gtgcaccatt ggtcttcgtc agttcctcct 1560 
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tcct^gctca cccgtggctg agtttcagat gtgagagcca gtgggtgtcc tgtcacagag 1620 
atacggtcgt cgtgtggggc ttcgccaggg gcatcttggc catcctcatc ccccgcctgg 1680 
acctggtcat ctccctggtg ggctccgtga gcagcagcgc cctggccctc atcatcccac 1740 
cgctcctgga ggtcaccacc ttctactcag agggcatgag ccccctcacc atctttaagg 1800 
acgccctgat cagcatcctg ggcttcgtgg gctttgtggt ggggacctat gaggctctct 1860 
atgagctgat ccagccaagc aatgctccca tcttcatcak ttccadctgt gccttcatat 1920 
agggatctgg gttcgtctct gcagctgcct acccctgccc catgtgtccc ccgttacctg. 1980' 
tcctcagagc ctcaggtatg gtccaggctc tgaggaaag't cagggttgct gtgtgggaac 2040 
ccctctgcct ggcacctgga taccctgggc caggtaacct gagggcagtg gagaggtggg 2100 
gtggcagaca cgcagaagtg ctactagtga cagggctgcc atcgctcacc tgtacctatt 2160 
tacacccaga actttccagc tccccctcat catgcctcct ccttcctacc tgcctcccct 2220 
ctgctggtgc acctcgccca actcattctt actgcacagt tcactttatt taacaatttt 2280 
catgtccccc acctcatgtt ttcacctttt ctgggccagg caitagattaa gtaactggga 2340 
acgccccctc tttataaagp tgggcttctt tctcatctct ctccc^aatg ttgtatctca 2400 
gtattcttcc tattcgagtc tccagggggt ggctggacct acctggtcat ttgaaacagg 2460 
cccccaagct ggagttttta atctggactc tctggcttgc tgtgacccct aaggcaatgc 2520 
ttctcttccc tggattcctt agtgtgggtc acagtactgt gttcttagtt gctttagctc 2580 
ttaaaacata cgaagtgttg cctaaactga aaatatttat cttttattta aaatcagatt 2640 
tttgttttta gactgtctta gatctggggc tattacgaat cacttcttct tcagtaaact 2700.. 
'ttgactcaac ttctcctgct gaaaagaagc tcgctccaga tgtctgcatg ggtcctcggc 2760 
actcttggct gaggactcaa aggttttaat caggatcgtc taaaaatgta cctcggtgag 2820 . 
gaggcacaga ttttgcctcc tgttgaccag cctggfcttca taccgaaaag acattgaagg 2880 
actgcagaaa tgtatgggtg caccgggccg agggaagggt ggctg-agtga gaggcgtata 2940 
, a^atggggct gtgtgcatgc aggcccatgt ttcagcctca gcccacgcca ggtgaaagga 3000 
tcagcaatgc tctgttgcca tcgtgctggg acgacaccag ctctattgcc accgatgagt 3060 
agctgaggtc agtgtgcaca gagtttgaaa ttaagttaat aga.ctttaca gcag 3114 

<2io> 41 • : 

<211> 2877 
<212> DNA 

<213> Homo sapiens , 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7477898CB1 

<400> 41 

. atgccggtcc gcaggggcca cgtcgctccc caaaacactt acctggacac catcatccgc 60 

aagttcgagg gccaaagtcg gaagttcctg attgccaatg ctcagatgga gaactgcgcc 120 

atcatttact gcaacgacgg cttctgcgaa ctcttcggct actcccgagt ggaggtgatg 180 

cagcaaccct gcacctgcga cttcctcaca ggccccaaca ca.ccaagcag cgccgtgtcc 240 

cgcctagcgc aggccctgct gggggctgag gagtgcaagg tggacatcct ctactaccgc 300 

aaggatgcct ccagcttccg ctgcctggta gatgtggtgc ccgtgaagaa cgaggacggg 360 

gctgtcatca tgttcattct caacttcgag gacctggccc agctcctggc caagtgcagc 420 

agccgcagct tgtcccagcg cctgttgtcc cagagcttcc tgggctccga gggctctcat 480 

ggcaggccag gcggaccagg gccaggcaca ggcaggggca agtacaggac catcagccag 540 

atcccacagt tcacgctcaa cttcgtggag ttcaacttgg a;gaagcaccg ctccagctcc 600 

accacggaga ttgagatcat cgcgccccat aaggtggtgg agcggacaca gaacgtcact 660 

"gagaaggtca cccaggtcct gtccctgggc gcggatgtgc tgccggagta caagctgcag 720, 

gcgccgcgca tccaccgctg gaccatcctg cactacagcc ccttcaaggc cgtgtgggac 780 

tggctcatcc tgctgctggt catctacacg gctgtcttca cgccctactc agccgccttc 840 

ctgctcagcg atcaggacga atcacggcgt ggggcctgca gctatacctg cagtcccctc 900 

actgtggtgg atctcatcgt ggacatcatg ttcgtcgtgg acatcgtcat caacttccgc 960 

accacctatg tcaacaccaa tgatgaggtg gtcagccacc cccgccgcat cgccgtccac 1020 

tacttcaagg gctggttcct cattgacatg gtggccgcca tccctttcga cctcctgatc 1080 

ttccgcactg gctecgatga gaccacaacc ctgattgggc tattgaagac agcgcggctg 1140 . 

ctgcggctgg tgcgcgtagc acggaagctg gaccgctact ctgagtatgg ggcggctgtg 120,0 

ctcttcttgc tcatgtgcac ctttccgctc atagcgcact ggctggcctg catctggtac 1260 

gccatcggca atgtggagcg gccctaccta gaacacaaga tcggctggct ggacagcctg 1320 

ggtgtgcagc ttggcaagcg ctacaacggc agcgacccag cctcgggccc ctcggtgcag 1380 

gacaagtatg tcacagccct ctacttcacc ttcagcagcc tcaccagcgt gggcttcggc 1440 

aatgtctcgc ccaacaccaa ctccgagaag gtcttctcca tctgcgtcat gctcatcggc 1500 

tccctgatgt acgccagcat cttcgggaac gtgtccgcga tcatccagcg cctgtactcg 1560 

ggcaccgcgc gctaccacac gcagatgctg cgtgtcaagg agttcatccg cttccaccag 1620 

atccccaacc cactgcgcca gcgcctggag gagtatttcc agcacgcctg gtcctacacc 1680 

aatggcattg acatgaacgc ggtgctgaag ggcttccccg agtgcctgca ggctgacatc 1740 

tgcctgcacc tgcaccgcgc actgctgcag cactgcccag ctttcagcgg cgccggcaag 1800 
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ggctgcctgc 
ctggtgcac'c 
atcctgcgcg 
gtcagcctcc 
gacctgcaca 
gcggagagct 
. ctccactcat 
gacaaccagt 
gagctactgc 
tgctggcctc 
gagtcccgcg 
cagggccacg 
atagcctcgg 
cagaccccaa 
aggatgcctc 
cagcctgagg 
ctcatctgtg 
ctggacttcc 



gcgcgctagc 
tcggcgacgt 
acgacgtggt 
atgcccagcc 
.agatccagcg 
tctggagtaa 
ccccccgaca 
cagatgcagc 
aggaaatgcc 
tgaagctggg 
tgtcctcaga 
ccagctacat 
agacgacgag 
gctatggaga 
acctggctgt 
ggctctggcc 
gtccctgctt 
agagacatgg 



cgtcaagttc 
gctctccacc 
cgtggccatc 
aggcaagtcc 
ggcagatctg 
gctggaggtc 
ggctcctggc 
ccctcccctg 
cccaaagcac 
ctccaggcta 
cctcagccgc 
tctggaagcc 
^:ccagggccc 
cttggatgac 
ggcaatggac 
acccctagcc 
ctcctccctc 
ctc^gatcct 



aagaccaccc 
ctctacttca 
ctaggaaaga 
agtgcagacg 
ctggaggtgp 
accttcaacc 
agccaagacc 
agcatctcag 
agcccccaaa 
gagcagctcc 
atcttgcagc 
actgcctcca 
aggctgcccc 
tgtagtc<paa 
aaaactctgg 
tcacctctac 
cctgaacacc 
ggatttgcag 



<aiO> 42 • 

<211> 2820 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



7472728CB1 



<400> 42 

atggggcatc 

aggaggaagg 

ggagggtggg 

ccctttagtc 

ccccaggaag 

aacgcttgta 

accttgactc 

tatgcaggat 

catgtgatca 

gatggattta 

gcagcaccgg 

actccaactc 

acctcccaag 

gttgtggtgg 

ga&agcagcc 

tgtgtgagcc 

ggagtcagtc 

tttttctttg 

ggaggcaaaa 

ttggctggaa 

aaggtctttc 

ttcatcttgg 

gagggctgga 

tttggtgatt 

tggtataagc 

ctcagtatga 

gaaatcaagg 

cggcgaaggc 

atggagcgcc 

gagaagcgct 

agcatcaaca 

gggcagggtg 

aagaggaaaa 

accttccgga 

aactcagaca 

gagaacggaa 

gaagacagaa 

tgtgtgtgtt 



aagggccatt 

aggcttggag 

gccatcgtgg 

ttttgggtcc 

gcagactcag 

caaaccatga 

tggatttctg 

taactctgca. 

cactgacacc 

agggggacag 

tgtgccagcc 

cgcgcctgtc 

srgggcttgca 

tctaccttgt 
agaagaatac 
cccaggagct 
caataggaaa 
ctggaactgt 
tcttttgtat 
ttggagacca 
gaaaaaagca 
ccggctgcat 
cggccttgga 
ttgtggcagt 
ccctagtgtg 
tcggagattg 
cccatgcggc 
tcagcgtgga 
ggcggctggg 
ctgtctttgc 
accggcccaa 
cgtccgagga 
acaaggacct 
attactccct 
actccagcac 
tgatacccac 
actaaatgtg 
tttaatattc 



tgaagaagga 
aagggactgt 
aagacagacc 
tcaaatcaca 
ggtaaattct 
agaactctct 
cctatgggaa 
gtcaaagttt 
tgagagagtg 
gactgaaggc 
caagagcgcc 
catttcctcc 
gaccgtcatg 
cactggcggt 
catcgccttg 
ggagacgttg 
ctcttccaac 
cattacgacc 
tttatatgcc 
acttggaacc 
agtgagtcag 
tgtgtttgtg 
gtccatttac 
ggttgttttc 
gttttggatc 
gctacgggtt 
agagtggaag 
gatccacgat 
cctggaccag 
tgccctggac 
caacctgcgc 
caacatcatc 
caiaaaagacc 
ggacgaggag 
agccatgctg 
ggacaccaaa 
aaggacattg 
acactgagac 



acgcgccgcc 
tctcccgagg 
atgacatctt 
tgcgggctct 
tggac^tg.ta 
tgcgggacgt 
accaaggttt 
atgcattctg 
^ccctcagga 
aggcccagat 
tcctccagaa 
atgacctggc 
agggctttct 
agcac&ggaa 
caccatcctc 
atcccctgga 
ttggctctgt 
ggagttgggg 



tggggacacg 
ctccatcgag 
tggggaaccc 
gacctactgc 
cccggccttt 
aaccgggggt 
ctttctpagt 
gctctggcct 
agacccagat 
gaacaggctg 
gcccatgccc 
cttggttcct 
gcctcctgca 
ctcctccccc 
agaacaggaa 
agtacaagga 
tcccaagcag 
ccactga 



1860 

1920 

1980 

2040 

2100 

2160 . 

2220 

2280 

2340 

2400 

2460 

2.520 

2580 

2640 

2700 

2760 

2820 

2877 



aatggtggac 
cttttaggag 
tatggtaggg 
gtttatgcag 
gcctgtc ttc 
ttggactgtt 
aaaaccacaa 
ctgttgggca 
acacaccatg 
tgtcgcagtg 
actaacgggc 
cgagccacag 
aagtggaaga 
cttgtcttcc 
gagaaggcgg 
-atccagcatg 
qiacagcagcc 
atgtatggga 
atctttggaa 
atctttggga 
accaagatcc 
acgatccctg 
tttgtggtgg 
aggggaaacg 
cttgttggcc 
ctgtccaaaa 
gccaatgtca 
aagctgcagc 
cgggcccact 
accggccgct 
ctgaaggggc 
aacaagttcg 
ttgcccgagg 
aagaaagagg 
acggactgta 
gaccgggagc 
gtcttggact 
atgtgcctta 



tgagagtgat 
ccctlgcccag 
cttgtggggt 
tttggcccca 
caccagagag 
tgctttttga 
tagtgccagg 
gagcattgtt 
tacatggctg 
attcagtggc 
aacccccggc 
tggtagccag 
cggtggttgc 
gggcattgga 
aattcctgcg 
ctcttgatgc 
actgggacct 
atattgctcc 
ttccactctt 
aaagcattgc 
gggtcatctc 
ctgtcatctt 
tcactctgac 
ctggcatcaa 
ttgcctactt 
agacaaaaga 
cggctgagtt 
gggcagccac 
cactggacat 
tcaaggcctc 
cggagcagct 
ggtccacctc 
acgttcagaa 
aggagacgga 
tccagcagca 
cggagaacaa 
gagcgttgtg 
aacagacttt 



agcgacctgg 
tgtttcctgt 
gaaagaaaag 
gtcagaggga 
gggactcacc 
gaatgttaac 
ggtgcttcca 
agcaggggtc 
gtatatggag 
cgttcccgca 
tccggctccg 
gatggaaggc 
catctttgtg 
gcagcccttt 
ggatcatgtc 
tgacaatgcg 
cggcagtgcc 
gagcactgaa 
tggtttctta 
aagagtggag 
aaccatcctg 
taagtacatc 
cacggtgggc 
ttatcgggag 
tgcagctgtc 
agaggtgggt 
ccgggagaca 
catccgcagc 
gctgtccccc 
atcccaggag 
gaacaagcat 
cagactcacc 
aatctacaag 
aaagatgtgt 
cgctgagttg 
ctcattactt 
tgtgtgtgtg 
ttagtccaaa 



60 

120 

180 

240 

300 

360 

420 . 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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attacatagc 

caaaaggaat ' 

tgtccgtgca 

tacccctggg 

gcctcctgaa 

gaaggggacc 

ggcatgtgct 

ctatgctcca 

tcagagcaca. 



attgaagaat 

cagagaacaa. 

cgtagccggt: 

cagtgctacc 

agaaacagg^ 

tgattgggtg 

ptgagcttga 

aatgaatgtc 

cgtacgaaaa 



<210> 43 

<211> 1440 

<212> DNA 

<213> Homo sapiens 



atatttcact 
gaacttcatt 
totggqcgta 
tgggcacaca 
gtgtcttttt 
ggag*ctggtt 
attttgatac 
tgtggggacc 
ggtgcaattc 



gtgccataaa 

tcagatagca 

catg.ttaagg. 

cgtagacaag 

agtggagtcg 

atgtgtaact 

caacJ:cattca 

tgagagcacc 

cttttctcat 



caactgaaag 
aacgcaggac 
gcatttcagt 
ggcagctatt 
tagtaatatg 
^gcgttggag 
gtgcafccata 
tggaatttgt 
gacaaaaggg 



cttgctctgc 
acaccaagag 
ggcagtgctg 
ccttagacca 
tgcacacaca 
ttgat^tttt' 
cctagtcttt 
tggaagcaga 
aaaaaaataa 



2340 
240O 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



<220> , 

<221> inisc_f eature 

<223> Incytfe ID No: '7474322CB1 



<220>. 

<22i> unspre 
<222> 757 
<223> a, t, c. 



g, or other 



<400> 43 

atgtacaatg 

ctcaccaaca 

aatcgccacg 

agattcgtca 

ttcaccatgg 

actggagact 

tttttccgag 

gacagctaca 

ctgtacttca 

tggaccaaca 

atagagaaga 

ttcgggtttt 

tctgagtcca 

agcctgtact 

ttcactgaga 

ctcacctaca 

aacgtctcca 

acggagaaga 

gtggggtaca 

tggaccacct 

gtcaagcgca 

aactttgccc 

cccgaggaag 

ttcaagagtc 



agattctgat 
agaagggaat 
acatgctctt 
agcgcatctt 
ctgcctacta 
atttccgagt 
ggattcagta 
gtgagatgct 
gccacctcaa 
tgctctacta 
tgatcctgag 
ccacagcggt 
cgtcgcacag 
ccacctgcct 
actatgactt 
tagttctcct 
aggagagcga 
gcttccttaa 
cacctgatgg 
ggaacaccaa 
ccctgagctt 
tggtccccct 
tttatctgcg 
ctgccgcttc 



gctgggggcc 
gacgcx:gctg 

ggt'ggagccg 

ctacttcaac 
caggcccgtg 
tactggagag 
tttcctgcag 
tttgtttctg 
ggagtatgtg 
cacccgcggt 
agacctgtgc 
ggtgacgctg 
gtggcggggg 
ggagctgttc 
caaggctgtc 
cctcaacatg 
acgcatctgg 
gtgcatgagg 
caaggacgac 
cgtgggcatp 
ctccctgcgg 
tttaagagag 
acagttttca 
cggggagaag 



aaactgcacc 
gctctggcag 
ctgaaccgac 
ttcctggtct 
gatggcttgc 
atcctgtctg 
aggcggccgt 
cagtcactgt 
gcttccatgg 
ttccagcaga 
cgtttcatgt 
attgaagacg 
cctgctngca 
aagttcacca 
ttcatcatcc 
ctcattgctc 
cgcctgcaga 
aaggccttcc 
taccggtggt 
atcaacgaag 
tcaagcagag 
gcaagtgctc 
gggtctctga 
tgaggacgtc 



cgacgctgaa 
ctgggaccgg 
tcctgcagga 
actgcctgta 
ctccctttaa 
tgttaggagg 
cgatgaagac 
tcatgctggc 
tattctccct 

tergercatcta 

ttgtctacat 
ggaagaatga 
ggcccaatag 
tcggcatggg 
tgctgctggc 
tgatgggcga 
gagccatcac 
gctcaggcaa 
gcttcgtgga 
acccgggcaa 
tttcaggcag 
gagataggca 
agccagagga 
acgcagacag 



gctggaggag 
gaagatcggg 
caagtgg^gac 
catgatcatc 
gatggaaaaa 
agtctacttc 
cctgtttgtg 
caccgtggtg 
ggccttgggc 
tgccgtcatg 
cgtcttcttg 
ctccctgccg 
ctcctacaac 
cgacctggag 
ctatgtaatt 
gactgtggag 
catcctggac 
gctgctgcag 
cgaggtgaac 
ctgtgagggc 
acactggaag 
gtctgctcag 
cgctgaggtt 
cactgtcaac 



<2i0> 44 

<211> 2394- 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



5455621CB1 



60 

120 

180 ■ 

240 

300 

360 . 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



<400> 44 

atttcagaac 

ctcttagcta 

cttctttgaa 

acgagtgcct 

tccactgtac 

ctctctggtg 

gtatttgcag 



acatctgaat 
gggtcagatt 
gtcaactttt 
ctggtaatca 
tgaactttgc 
tcatactttc 
ccctcttttt 



tccttctctg 
tcaaattctc 
gagatcttca 
ttttcttcag 
ttttggaaac 
caatatggag 
catttcctct 



tggcatatgc 
atctcttggt 
ctaagtacac 
ggaatcacag 
atcttcttcc 
gtgaagaact 
ggaattgggg 



tttaggagag 
gccaatacca 
gttggtgtct 
tctctcctct 
tgagacctcg 
ttgcagtttg 
tgttctttgc 



gagcagacag 60 
ccaccagatt 120 
gaagattcac 180 
cagcaaagca 240 
1 1 gaaagaaa 300 
ggattatgtt 360 
cattaaggag 420 
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agaaaaaagg caacttcccg agagttcctg gttgggggaa ggcaaatgag ctttggccct 480 
gtcggcttgt ctctgacagc cagcttcatg tcagctgtca cggtcctggg gaccccttct 540 
.„ gaagtctacc gctttggggc atccttccta gtcttcttcd ttgcttacct atttgtcatc 600 
ctcttaacat cagagctctt tctccctgtg ttctacagat ctggtatcac cagcacttat 660 
gagtacttac aactacgatt caacaaacc^ gttogctat,g ctgccacagt catctacatt 720 
gtacagacga ttctctacac aggagtggtg gtgtatgctc ctgccetggc actcaatcaa 780 
gtgactg*ggt ttgatctctg gggctctgtg tttgcaacag gaattgtttg cacattctac • 840 
tgtaccctgg gaggattaaa agcagtggtg tggacagatg catttfcagat ggttgtcatg 900 
attgtgggct tcttaacggt tctcattcaa ggatcaactc atgctcrgggg attccacaat 960 
gtattagagc aatcaacaaa tggatctcga ctacatatat ttgactttga tgtagatcct 1020 
ctcaggcgac acactttttg gactatcaca gtgggaggaa cttttacttg gctcggaatc 1080 
tatggggtca atcaatcaac tattcagcga tgcatqtctt gcaaaacaga aaagcatgct 1140 
aagcttgcct tgtattttaa cttgctgggt ctctggatca ttbtggtgt'g tgctgtcttc 1200 
tctggcttaa tcatgtacto tcactttaaa gactgtg^cc cttggacttc tggcatcatc 1260 
tcagcaccag accagctgat gccgtacttt gtcatggaga tatttgccac aatgccagga 1320 
ctgccaggac tttttgtggc ttgtgccttc agtggaactc tgagcaccgt ggcttccagc, 1380 
atcaatgcct tggcaacagt gacctttgag gattttgtca agagctgttt tcctcatctc 1440 
tccgacaagc tgagcacctg gatdagtaaa ggcttatgtc tcttatttg'g cgtgatgtgt 1500 
pcctctatgg ctgtggctgc atctgtcatg ggaggtgttg tgcaggcttc cctcagcatt 1560 
cacggcatgt gtggaggacc aatgctgggc ttattctccc tgggaatcgt gttccctttt 1620 
gtgaactgga agggtgcact aggaggtctt cttactggaa tcaccttgtc attttgggtg 1680 
gccattgggg ccttcattta ccctgcacca gcctctaaga catggccttt gcctctatca 1740 
acagaccaat gtatcaaatc aaatgtgaca gcaacagggc ctccagtact atccagcaga .1800 
I cGtggaatag ctgatacctg gtactcga^tc tcctaccttt actacagtgc actgggctgc 1860 
ttaggatgca ttgttgctgg agtaatcatc agcctcataa caggtcgcca aagaggtgag 1920 
gatattcaac cactgttaat tagaccagtt tgtaatttat tttgcttttg gtctaagaag 1980 
tacaaaacac tatgctggtg tggagttcag catgacagtg ggacag^gca ggaaaacctt 2040 
gagaatggca gtgcccggaa acagggggct gaatctgtct tacagaacgg actcagaaga 2100 
gaaagcctgg tacatgttcc aggctatgat cctaaggaca aaagctacaa caatatggca 2160 
tttgagacta cccatttcta aggcaatacc tgtatgaatg cacacacaca cgtgcaatac 2220 
acacacacac acacaaactc cacatacttc ttgcctactt gttagtagat atgtatagtt 2280 
gccattgcta gaagacaggg atgtctggtg cctatttcta cttatttata actacatgca 2340 
aaatgactgt ctctcgggat attctttgaa agactccaac tttcacagag aaaa 2394 

<210> 45 
<211> 2890 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID ISTo: 7477248CB1 

<400> 45 

gaatactaiag ccagggcaga atgcttgtga agtagcaact aaagtggcag tgtttcttct 60 
gaaattctca ggcagtcaga ctgtcttagg caaatcttga taaaatagcc cttatccagg 120 
tttttatcta aggaatccca agaagactgg ggaatggaga gacagtcaag ggttatgtca 180 
gaaaaggatg agtatcagtt tcaacatcag ggageggtgg agctgcttgt cttcaatttt 240 
ttgctcatcc ttaccatttt gacaatctgg ttatttaaaa atcatcgatt ccgcttcttg 300 
catgaaactg gaggagcaat ggtgtatggc cttataatgg gactaatttt acgatatgct 3 60 
acagcacdaa ctgatattga aagtggaact gtctatgact gtgtaaaact aactttcagt 420 
ccatcaactc tgctggttaa tatcactgac caagtttatg aatataaata caaa^gagaa 480 
ataagtcagc acaacatcaa tcctcatcaa ggaaatgcta tacttgaaaa gatgacattt 540 
gatccagaaa tcttcttcaa tgttttactg ccaccaatta tatttcatgc aggatatagt 600 
ctaaagaaga gacacttttt tcaaaactta ggatctattt taacgtatgc cttcttggga 660 
actgccatct cctgcatcgt catagggtta attatgtatg gttttgtgaa ggctatgata 720 
catgctggcc agctgaaaaa tggagacttt catttcactg actgtttatt ttttggttca 780 
ctgatgtctg ctacagatcc agtgacagtg ctggccattt tccatgaact gcacgtcgac 840 
cctgacctgt acacactctt gtttggagag agtgtgttga atgatgcagt ggccatagtc 900 
cttacatatt ctatatccat ttacagtccc aaggagaatc caaatgcatt tgatgccgca 960 
gcattcttcc agtctgtggg gaatttcctg ggaatcttcg ctggctcatt tgcaatgggg 1020 
tctgcgtatg ccatcatcac agcactgttg accaaattta ccaagctgtg tgagttcccg 1080 
atgctggaaa ccggcctgtt tttcctgctt tcttggagtg ccttcctgtc tgccgaggct 1140 
gccggcctaa cagggatagt tgctgttctc ttctgtggag tcacacaagc acattatacc 1200 
tacaacaatc tgtcttcgga ttccaaaata agaactaaac agttgtttga atttatgaac 1260 
tttttggcgg agaacgtcat cttctgttac atgggcctgg cactgttcac gttccagaat 1320 
catatcttta atgctctttt tatacttgga gcctttctag caatttttgt tgccagagcc 1380 
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tgcaacatat atcccctctc cttcctcctg aatctaggcc gaaaacagaa gatcccctgg 1440 
aactttcagc ' acatgatgat. gttttcaggt ttgcgaggag cgatcgcatt tgccttagct 150'0 
attcggaaca cagaatct<pa gdfccaaacaa atgatgtt.ta ccactacgct gctccfecgtg 1560 
ttcttcactg tctgggtatt tggaggagga acaaccccca tgttgacttg gcttcagatc 1620 
agagttggcg tggacc.tgga tgacjaatctg aaggaggacc cctccfccaca acacc^ggaa. 1680 
gcaaataact tggataaaaa catgacgaaa gcagagagtg ctcggctctt cagalatgtgg 1740 
tatagctttg accacaagta tctgaaacca attttaaccc actetggtcc tccgctgact 1800 
acaacattac ctgaatggtg tggtccgatt tccaggctgc ttaccagtcc tcaagcctat 1860 
ggggaacagc taaaagagga tgatgtggaa tgcatt'gtaa aiccaggatga actagccata 1920 
aattaccagg aigcaagcctc ctcaccctgc agtcctcctg caaggctagg tctggaccag 1980 
aaagcttcac cccagacgcc aggcaaggaa aacatttatg agggagacct cggccctggg 2040 
aggctatgaa ctcaagcttg agcaaacttt gggtcaatcc cagttgaatt aattggcatg 2100' 
aagagtacag atgtaatcac ^agtaatgca agactcactg aggaatacaa gccaagctga 2160 
tgaggcagta caggggagag gctggaaaac dtattaa^ag ' cataaattgg agagaatcaa 22'20 , 
agccttgtca catggatcct ctggtgcctg aagaaatgag attttattat ccctctctat 2280 ' 
tatgcaaatg aatttagttt. tttgacagca gccattctga ttactggatt ggctggggtg 2340 
gggatggagg tatcaggagrt ctagctgctg gaggatggga cagctgtgqt gggtcttcag 2400 
ggcatttctg btgcgaatgc. ggctctccag gcccttcact tctattctgg attttattcc 2460 
ctccattaag gagagtttka aaataaaaga aagcttctga gagtaaacat tttgctccta 2520 
agctgaaggg aatgcccagc tatttagtaa gtgataagtt tcttattttg aggacttgac 2580 
tcceatttgc tctc.agtgac cccagggcag agcccagaga agtgttccgt acccactgct 2640 
gatggtttcc cagagcccac actgagttga agaacctatt gttcttcttg gcatccttct 2700 
tatgctactt ctcccatcgc tcaaaggggt tgcctatggc tgggtgrtgcc ctgccctaaa 2760 
tgcagcacca ctttcaagca gcttctagct atagctttcc accaggtatt tttaatccca 2820 
tttcacctcc tcccccagca attcaccagt caggagtgat ttttactgta aagatggttg 2880 . 
cttagtaaaa 2890 

' V ■ 

<210> 46 * . . . ' ' J ' 

<211> 3926 , 

<212> DNA . ' ■ ■ . . 

<213> Homo sapiens • 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2944004CB1 

<400> 46 • . 

ctggaccttt aatccactgt aggtatggac agggaagaaa ggaagaccat caatcagggt 60 
caagaagatg aaatggagat ttatggttac aatttgagtc gctggaagct tgccatagtt 120 
tctttaggag tgatttgctc tggtggggtt tctcctcctc ctctctattg gatgcctgag 180 
tggcgggtga aagcgacctg tgtcagagct gcaattaaag actgtgaagt agtgctgctg 240 
aggactactg atgaattcaa aatgtggttt tgtgcaaaaa ttcgcgttct ttctttggaa 3 00 
acttacccag tttcaagtcc aaaatctatg tctaataagc tttcaaatgg ccatgcagtt 3 60 
tgtttaattg agaatcccac tgaagaaaat aggcacagga tcagtaaata ttcacagact 420 
gaatcacaac agattcgtta tttcacccac catagtgtaa aatatttctg gaatgatacc 480 
attcacaatt ttgatttctt aaagggactg gatgaaggtg tttcttgtac gtcaatttat 540 
gaaaagcata gtgcaggact gacaaagggg atgcatgcct acagaaaact gctttatgga 600 
gtaaatgaaa ttgctgtaaa agtgccttct gtttttaagc ttctaattaa agaggttctc 660 
aacccatttt acattttcca gctgttcagt gttatactgt ggagcactga tgaatactat 720 
tactatgctc tagctattgt ggttatgtcc atagtatcaa tcgtaagctc actatattcc 780 
attagaaagc aatatgttat gttgcatgac atggtggcaa ctcatagtac cgtaagagtt 840 
tcagtttgta. gagtaaatga agaaatagaa gaaatctttt ctaccgacct tgtgccagga 900 
gatgtcatgg tcattccatt aaatgggaca ataatgcctt gtgatgctgt gcttattaat 960 
ggtacctgca ttgtaaacga aagcatgtta acaggagaaa gtgttccagt gacaaagact 1020 
aatttgccaa atccttcagt ggatgtgaaa ggaataggag atgaattata taatccagaa 1080 
acacataaac gacatacttt gttttgtggg acaactgtta ttcagactcg tttctacact 1140 
ggagaactcg tcaaagccat agttgttaga acaggattta gtacttccaa aggacagctt 1200 
gttcgttcca tattgtatcc caaaccaact gattttaaac tctacagaga tgcctacttg 1260 
tttctactat gtcttgtggc agttgctggc attgggttta tctacactat tattaatagc 1320 
attttaaatg aggtacaagt tggggtcata attatcgagt ctcttgatat tatcacaatt 13 80 
actgtgcccc ctgcacttcc tgctgcaatg actgctggta ttgtgtatgc tcagagaaga 1440 
ctgaaaaaaa tcggtatttt ctgtatcagt cctcaaagaa taaatatttg tggacagctc 1500 
aatcttgttt gctttgacaa gactggaact ctaactgaag atggtttaga tctttggggg 1560 
attcaacgag tggaaaatgc acgatttctt tcaccagaag aaaatgtgtg caatgagatg 1620 
ttggtaaaat cccagtttgt tgcttgtatg gctacttgtc attcacttac aaaaattgaa 1680 
ggagtgctct ctggtgatcc acttgatctg ciaaatgtttg aggctattgg atggattctg 1740 
gaagaagcaa ctgaagaaga aacagcactt cataatcgaa ttatgcccac agtggttcgt 1800 
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cctcccaaac aactgcttcc tgaatctacc cctgcaggaa accaagaaat ggagctgttt 1860 
gaacttccag' ctacttatga- gataggaatt gttcgccagt tcccattttc ttctgctttg 1920 
caacgtatga gtgtggttgc c^gggtgctg ggggatagga aaatggacgc!: ctacafega'aa 1980 
ggagcgcccg aggccattgc cggtctctgt aaacctgaaa cagttcctgt cgattttcaa 204'0 
aacgttttgg aagacttcac taa^cagggc ttccgtgtga ttgctcttgc acacagaaaa 2100 
ttggagtcaa aiactgacatg gcataaagta cagaatatta gcagagatgc aattigagaac 2160 
aacatggatt ttatgggatt aattataatg cagaacaaat taaagcaaaa aacccctgca 5220 
gtacttgaag atttgcataa agccaacatt cgcaccgtca tggtcacagg tgacagtatg 2280 
ttgac.tgctg tctctgtggc cagagattgt ggaatgattc tacctcagga taaagtgatt 2340 
attgctgaag cattacctce aaaggatggg aaagttgcca aaataaattg gcattatgca 2400 
gactccctca cgcagtgcag tcatccatca. gcaattgacc cagaggctat tccggttaaa 2460 
ttggtccatg atagcttaga ggatcttcaa atgactcgtt atcattttgc aatgaatgga 2520' 
aaatcattct cagtgatact ggagcatttt caagaccttg ttcctaagtt gatgttgcat 2580 
ggcaccgtgt ttgcccgtat ggcaicctgat dagaagacac 'agttgataga agcattgeaa 26'40 
aatgttgatt attttgttgg gatgtgtggt gatggcgcaa atgattgtgg tgctttgaag 2700 
agggcacacg gaggcatttc. cttatcggag ctcgaagctt cagtggcatc tccctttacc 2760 
tctaagactc ctagtatttc ctgtgtgcca aaccttatca gggaaggcqg tgctgcttta 282 0 
ataacttcGt 'tctgtgtgtt taaattcatg gcattgtaca gcattatcca gtacttcagt 2880 
gttactctgc tgtattctkt cttaagtaac ctaggagact tccagtttct cttcattgat 2940 
ctggcaatca ttttggtagt ggtatttaca atgagtttaa atcctgcctg gaaagaactt 3000 
gtggcacaaa gaccaccttc gggtcttata tctggggccc ttctcttctc cgttttgtct 3060 
cagattatca tctgcattgg atttcaatct ttgggttttt tttgggtcaa acagcaacct 3120: 
tggtatgaag tgtggcatcc aaaatcagat gcttgtaata caacaggaag cgggttttgg 3180 
aattcttcalc acgtagacaa tgaaaccgaa cttgatgaac ataatatacA aaattatgaa 3240 
aataccacag tgttttttat ttccagtttt cagtacctca tagtggcaat tgccttttca 3300 
aaaggaaaac ccttcaggca accttgctac aaaaattatt tttttgtttt ttctgtgatt 33 60 
tttttatata tttttatatt att;ca(tbatg ttgtatecag ttgcctctgt tgaccaggtt 3420 
cttcagatag tgtgtgtacc atatcagtgg cgtgtaacta tgctcattat tgttctt;gtc 3480 
aatgcctttg tgtctatcac agtggagaa.c ttcttccttg acatggtcct ttgga,aagtt 354 0 
gtgttcaacc gagacaaaca aggagagtat cggttcagca ccacacagcc accgcaggag 3 600 
^tcagtggatc ggtggggaaa atgctgctta ccctgggccc tgggctgtag aaagaagaca 3 660 
ccaaaggcaa agtacatgta tctggcgcag gagctcttgg ttgatccaga atggccacca 3720 
aaacctcaga caaccacaga agctaaagct. ttagttaagg agaatggatc atgtcaaatc 3780 
atcaccataa catagcagtg aatcagtctc agtggtattg ctgatagcag tat tcaggaa 3840 
tatgtgattt taggagtttc tgatcctgtg tgtcagaatg gcactagttc agtttatgtc 39 00 
ccttctgata tagtagctta tttgac . . 3926 

<210> 47 . 

<211> 2135 

<212> DNA 

<213> Homo, sapiens 

<220> ' , 

<22il>. misc_feature ' « 

.<223> Iiicyte ID No: 3046849CB1 . 



atgctccacg ccctcctgcg atctagaacg attcagggca 60 
tgcgctgccg gcattggtgg gacttttcag tttggctata 120 
ccgaccttgc acattcagga attcaccaat gagacatggc 180 
ctgcccgatc accthgtcct gcttatgtgg tccctcatcg 240 
ggcctctttg gagcactgct tgcaggtccc ttggccatca 3 00 
ctcctggtga ataacatctt tgtggtgtca gcagcaatcc 3 60 
gcaggctcct ttgagatgat catgctggga agactgctcg 420 
agcatgaaca tccagcccat gtacctgggg gagagcgccc 480 
gtggccatga. gctcagccat ctttacggct ctggggatcg 540 
ctcagggagc tcctaggtgg ccctcaggcc tggcccctgc 600. 
cccggggcgc tccagctcgc ctccctgcct ctgctccctg 660 
attgactgtg gagacaccga ggcctgcctg gcagcactac 720 
gacttggcag gggagctgga ggagctggag gaggagcgcg 780 
gcccggcgcc catgggagct gttccagcat cgggccctga 840 
gtggttctgg gcagtgccat ggagctctgc gggaatgact 900 
tccgtgttcc ggaaggcagg agtgccggaa gcgaagatcc 960 
gggagctgcg agctgctcac ggcggttgtt agttgtgtgg 1020 
cgcgtgctgc tcatcggtgg gtacagcctg atgacctgct 1080 
gccctgtgcc tgcagagctc cttcccctgg acactctacc 1140 
gccttcatcc tcagctttgg cattggccct gccggagtga 1200 
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<400> 47 

cgctcaggcc 

ggatcctgct 

acctctctat 

agcfcgcgtac 

tgtctctgta 

cgctgggaag 

tgtttggatt 

tgggagtcaa 

ctaaggagct 

tgatgggaca 

tgctggccag 

aaagcccgcg 

ggcagctacg 

ctgcctgcca 

ggagaeaggt 

cggtgtacgc 

agtacgcgat 

taatcgagag 

gggggagcat 

tggccatggc 



cctctttcga 
cctgaccatc 
catcaatgcc 
tggagagcca 
tcccctggga 
gaagaagtcc 
cagccgcaaa 
tgcaggtgtg 
ccgaggagct 

ggtggtcgga 

ctgcctggtg 
ctacctcctc 
gggctccggg 
gggctgccgt 
gacaagcctc 
ctacgcctcc 
catcgggact 
ggtgggtcgg 
cttcactgtg 
ctgcatcttt 



BNSDOCID: <WO_0204520A2J_> 



wo 02/04520 



PCTAJSOl/21448 



cggg^atcct 
gcggggcgct 
aggccttgtc 
acactggcct 
tacacagact 
tccagtcaac 
gtcctctgct 
gtgtttattg 
agcgtggtat 
aggctggttg 
atcaattgag 
caaaaaattt 
gaggctgagg 
ttcacaccac 
taataataat 
tgaggtggtg 



ggccacagag 
catgtggatc 
ccacttcctc 
gttccttcct 
caacttcccc 
agaactctag 
tcctgccagg 
agcacccttt 
tccaggctaa 
tggcagctca 
gccagagttt 
taaacattag 
cagcaggatc 
tgcatgccca 
aattacggac 
atcccgtatc 



ctgtttgacc 
atgctcatcc 
tatgtccctt 
gagaccaaag 
aggcgggccc 
tcccaaaggg 
gccctggtcc 
gtgtgcagac 
aggtaattaa 
tgactgtaat 
gaaaccagcc 
ctgggcatgg 
acttgagtcc 
gactggggtg 
tccggaaatg 
ccagccttgg 



agatggccag 
tggtcggcct 
tccttggtgt 
gcaagacctt 
9-gggccccac 
gtggccagag 
tcactccctc 
atggctccag 
ctgacagaaa 
cccagcactt 
taggtaacat 
t^gtatgtgc 
caagagttca 
acagaggcpag 
cgttgacaac 
gaagc 



•gcctgctgcc 
gggatttccc 
ctgtgtctgt 
ccaagagatc 
gtggagga^c 
ccaaag'ccag 
ctgcattcct 
gtgcttagca 
atcagtaaca 
tgggaggcca 
agtgagaccc 
taacagctct 
agigtagca^t 
acttc^tctc 
gaaacatacc 



tgcatggtct 

tttatcatgg 

ggggccatct 

tccaaggaat 

ctggaggtta 

ctactgtcct 

catttaagga . 

citcaatggtg 

acataattac 

aggtgggagg 

cctatctcta 

agctactcag 

aagctaacaa 

tttaaaaaca 

ggtggccccg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2135 



<210> 48 

<211> 2637 

'<2;L2> dna. 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



45383 63CB1 



<400> 48 

atgggctgga 

ccctttctga 

acagccatgg 

gctgacatca 

tcttgtcggg 

tggtggccga 

ctggcgggct 

tacgtgctgc 

accttacctg 

gtcctgtccc 

ctgtttgtgc 

atcacagccc 

cagacgctca 

ggtggttacg 

aacaccacct 

ggggacctgc 

tgcaccgacc 

gcgggctcca 

ggcatgatca 

ctgcgggcct 

gaactgatgc 

tcgtcgctga 

cggctgcgtc. 

gcactcatcg 

ctcttcatct 

gtcctgggcg 

gggctggtgg 

ggagagccag 

gccctctttg 

cagagtgtcc 

actaaagcag 

ttcaatgcca 

ctgccatcct 

ggggatcccg 

ccaagaactg 

gaaacataag 

gcgacaaact 

ctaccaacac 

cagaaaggca 

atgccaaaac 



gatgccactg 
cctggtttgc 
ccgccaactc 
tcgtcatcac 
ccagtaggaa 
ttggagcctc 
caggcgcggc 
tggcactggc 
agtacattca 
tgctactgtc 
acatctgcct 
tgtacaccat 
tcatggtggt 
ggcagctgga 
gccacctgcc 
cgtggaccgg 
aggtcatcgt 
tcctggccag 
gccgcgcatt 
gcggggccga 
ccatcggtct 
cctccatctt 
cccgctccgg 
gcgtgagtgt 
acatgcagtc 
tcttctggcg 
tgggggccac 
acacgcggcc 
cactcagtgg 
agattgagaa 
gtgatggcca 
tcctcctcat 
ggacagaaag 
aggccccaag 
gccaagccag 
tgtgactttt 
tagattttcc 
agctcatcac 
cccccccact 
gacacaagcc 



tccgcttggt 
ccctcjagtcc 
caccagcgac 
tgtgtatttt 
cacggtgaat 
cctcttcgcc 
aggaggtctg 
atgggtgttc 
gaagcgctac 
tgtcttcacc 
gggctggaac 
cgcagggggc 

gggggctgtc 

ggcagcctac 
acgtacagac 
gatgaccttt 
gcagcgatca 
ctacctcaag 
gttcccagat 
ggtcggctgc 
gcgggggctg 
caacagcagc 
cgagcgggag 
ggcctggatc 
agtgaccagc 
acgtgccaac 
gaggctggtc 
agccgtcctg 
tgctgttgtg 
ccttacctgg 
aacaccccag 
gtgtgtcaac 
gcaggagctc 

aggggcagat 

caaagcggga 
tccaaacaac 
tgaccgggcc 
catactcaca 
ctcaagacgc 
aggacacggc 



ttaatg^tca 
ctcgggctca 
ctccacactc 
gctctgaacg 
ggctacttcc 
agcagcgagg 
gccgtggcag 
gtgcccatct 

gggggccagc 

aagatatcgc 
ttctacctct 
ctggctgctg 
atcctgacaa 
gcccaggcca 
gccatgcaca 
ggcctgacca 
ctgtcagccc 
atgctcccca 
gatgtgggct 
tccaacatcg 
atgatcgcag 
agcaccctct 
ctcctgctgg 
cccgtcctgc 
tccctggccc 
gagcaggggg 
ctggaattcc 
gggagcatcc 
gtggctggaa 
tggaccctgg 
aaacacgcct 
atattctttt 
tgagtcctca 
tcccctcaca 
gcctgaaaac 
agcacccaaa 
caccacaccc 
ccacccacag 
gacggcgcaa 
actgtataca 



atgagctccc 
tacctagtgc 
ccgggacgca 
tggcpgtggg 
tggcaggccg 
gctctggcct 
gcttcgagtg 
acatctcctc 
ggatccgcat 
tggacctgta 
ccaccatcct 
taatctacac 
tcaaagcttt 
ttccctccag 
tgtttcgaga 
tcatggccac 
gggacctgaa 
tgggcctgat 
gcgtggtgcc 
cctaccccaa 
tgatgctggc 
tcactatgga 
tgggacggct 
aggactccaa 
caccagtgac 
ccttctgggg 
tgaacccagc 
actacctgca 
gcctgctgac 
ctcaggatgt 
tctgggcccg 
atgcctactt 
ggtccaccca 
gctgcacagc 
^ttagggggg 
gcaagtcaag 
caacctcctc 
cggcccgccc 
tcgactgcaa 
gcacgagggt 



tgccaggaaa 

ctgcggcagg 

gctgagcgtg 

catatggtcc 

ggacatgacg 

cttcattgga 

gaatgccacg 

agagatcgtc 

gtacctgtct 

cgcgggggct 

cacgctcggc 

ggacgccctg 

tgacc agate 

gaccattgcc 

cccccacaca 

ctggtactgg 

ccatgccaag 

catcatgccg 

gtccgagtgc 

gctggtcatg 

ggcgctcatg 

catctggagg 

ggtcatagtg. 

cagcgggcaa 

tgcagtcttt 

cctgatagca 

cccaccgtgc 

cttcgctgtc 

cccaccccca 

gcccttggga 

tgtctgtggc 

cgcctgaaca 

tttccctcat 

agctcggtgc 

aaactgggac 

catttggaac 

acctcccaaa 

ccactccaat 

ctccataacg 

gatctgcaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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BNSDOCID: <WQ 0204520A2 I > 



wo 02/04520 



PCT/USOl/214'48 



gttgtggccg aatgcagaaa atacactggg tgctggcgta aggaagatcc gcgagtaaac 2460 
aacggtcttg taaacttact. gcatccacca aggtacactt ccagaacgag accagacaac 2520 
tacactccac acaacctgca gccacaccct atttctgctd tcataaagag cccccgcacc 2580 
acataataat gccggcagac tcagtgcgeg aaacccttgt gctggacttc accacgg 2637 

<210> 49 ' * ' • • . 

<211> 2l783 

<212> DlStA. • ' ; 

<2i3> Homo sapiens ' 



<220> 

<221> misc^feature 
<223> Incyte ID No: 



6427460CB1 



<400> 49 

gcactagtac 

agcgggagcg 

ctgtgcagga 

pcctccgggc 

caagtacaca 

caacttttat 

agtgacaagc 

tgaagactgg 

tcagcacggc 

ggttaaggag 

tgggacgtgc 

cgcggtccag 

catcgagtgt 

cagtgacctg 

agctacactg 

caagatggca 

gaatgcgttc 

gctgaaatac 

ggagtcggaa 

ggtcctcttt 

cctcggctct 

gcctctggtg 

agacaagacc 

ccatgtctac 

cgacatgatt 

ggccc tctgt 

gaaatcgccg 

gctggtcgaa 

gg^gatatta 

tgactcagtc 

gttttgcaaa 

gatccgagcc 

.taaaaggctg 

ggcccttcaa 

tactctgctt 

cgaggccctg 

ggccgcggcc 
gaccaccaag 
ggtcctgcgc 
gcaggactac 
agacgggagt 
ggtgctctgc 
ttcaaaagag 
tctggaagcg 
cagcgactat 
tttttattac 
catcttccct 
caccgcgtat 
cctcatggag 
cgtcgccaag 
gtttgacgca 
aagcaacggg 



cccggagccc 
gdcggaggag 
gaagagaatt 
gcagaggcct 
ttttggaact 
ttcottatca 
ggacttccac 
cttcgacata 
aagctcgttc 
gacgagacct 
cacgtcacca 
gacaccaaag 
gagcagcccc 
aa.tgaccccg 
aagaacactg 
ttaaattatc 
ctcattgtgt 
gtgtggcaga 
aggcagagga 
aactacatca 
tacttcatca 
aacacgtcgg 
ggcaccctca 
gtgccccacg 
gactcgtccc 
ctctgccaca 
gacgggggga 
ggtgtccaga 
aacagggaga 
agaaggagaa 
ggagcagatt 
agagtggagc 
atccaagaag 
gatcgagaga 
ggtgctacag 
cagaaggccg 
acgtgctacg 
aggatcgagg 
cacagcggga 
ggtttaatta 
tccggcaact 
tgccgcatgg 
cacccaatca 
cacgtgggca 
gcaatcccaa 
attaggatct 
cagtttttat 
ctgaccctct 
cagcatgttg 
aatgccctgc 
ctggtgttct 
cagatatttg 



atgggcgcgc 
ccatggactg 
gggtlggacag 
acatcccaca 
ttataoccaa 
tatttctggt 
tcttctttgt 
aageagacaa 
ggaaacaaag 
ttccctgcga 
ccgccagctt 
gcttccacac 
agcccgacct 
tggtgaggcc 
agaaaatctt 
aatcaaaatc 
atctctgcat 
gtgagccctt 
atctgttcct 
tccctgtgtc 
cctgggacga 
acctcaatga 
cggaaaacaa 
tcatctgcaa 
ccagcgtcaa 
ccgtccaggt 
aatcctgtgt 
gacttggctt 
accacatcga 
tgagtgtaat 
cttcgatatt 
gtaacgcagt 
aatatgaagg 
aaaagttagc 
ctgttgagga 
ggatcaaagt 
cctgcaagct 
agcagagcct 
gcctgaccag 
tcgacggagc 
acagggagct 
cgcccttgca 
cgttagcaat 
taggtgtcat 
agtttaagca 
ctgagctcgt 
accagttctt 
acaacatcag 
gcattgacgt 
tgcgctggcg 
tctttggtgc 
gaaactggac 



cgagccgggc 

cagcctcgtg 

caggaccatc.. 

gagataccca 

gaatttattt 

gcagttgatt 

cat tact gtg 

tgccatgaac 

tcgaaagctg 

cttgatcttc 

ggatggagaa 

agaggaggat 

ctacaagttc 

cttaggatcg 

tggtgtggct 

tcagaagcga. 

tctgatcagc 

tcgggatgag 

caaggcattc 

catgtacgtc 

agacatgttt 

agagctggga 

catggagttc 

cgggcsaggtc 

cgggagggag 

gaaagacgat 

gtacatctca 

tacctaccta 

aaggtttgaa 

tgtaaaatct 

cccccgagtg 

ggaggggctc 

catttgtaag 

agaagcctat 

ccggctgcag 

ctgggttctc 

cttccgcagg 

gcacgacgtc 

agacaacctc 

tgcactgtct 

cttcctggaa 

gaaggctcag 

tggcgatggt 

cggcaaggaa 

tttgaagaag 

gcagtacttc 

ctgtgggttt 

cttcacctcc 

gctcaagaga 

cgtgttcatc 

ttatttcgtg 

gtttggaacg 



gcgggggcgc 
cggacgctcg 
tacgtgggac 
gacaacagga 
gaacaattca 
attgatacac 
acggctatca 
cagtgtcctg 
cgagttgggg 
ctttccagca 
tccagccata 
atcggcggac 
gtgggtcgca 
gaaaacctgc 
atttacacgg 
tctgccgtgg 
aaagccctga 
ccgtggtata 
acggacttcc 
acggtcgaga 
gacgaggaga 
caggtggagt 
aaggagtgct 
ctcccagagt 
cgcgaggagc 
gacagcgtag 
tectcgcccg 
aggctgaagg 
ttgctggaaa 
gctacaggag 
atagaaggca 
cgaactttgt 
ctgctgcagg 
gagcaaatag 
gagaaagctg 
acgggagaca 
aacacgcagc 
ctgttcgagc 
tccggacttt 
ctgataatga 
atctgccgga 
attgttaaat 
gcaaatgatg 
ggccgccagg 
atgctgcttg 
ttctataaga 
tcacaacaga 
ctccccatcc 
gacccgaccc 
tactggacgc 
tttgaaaata 
ctggtattca 



tgaacggcgg. 

tgcacagata. 

acagggagcc 

tcgtctcgtc . 

gaagagtagc 

ccacaagtcc 

aacagggtta 

ttcatttcat 

acattgtcat 

accggggaga 

aaacgcatta 

ttcacgccac 

tcaacgttta 

tgcttagagg 

gaatggaaac 

aaaaatcgat 

taaacactgt 

atcagaaaac 

tggccttcat 

tgcagaagtt 

ctggcgaggg 

acatcttcac 

gcatcgaagg 

cgtcaggaat 

tgtttttccg 

acggccccag 

acgaggtggc 

acaattacat 

ttttgagttt 

aaatttatct 

aagttgacca 

gtgttgctta 

ctgccaaagt 

agaaagatct 

cagacaccat 

agatggagac 

tgctggagct 

tgagcaagac 

cagcagatat 

agcctcgaga 

gctgcagcgc 

taatcaaatt 

tcagcatgat 

ctgccaggaa 

ttcacgggca 

acgtctgctt 

ctttgtacga 

tcctgtacag 

tgtacaggga 

tcctgggact 

caactgtgac 

ccgtgatggt 



60 

120 

180 

240 

300 

360 

420 

480 • 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380. 

1440 

1500 

iseo 

1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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BNSDOCID: <WO_0204520A2J_> 



wo 02/04520 



PCTAJSOl/21448 



gttcacagtt acactaaagc ttgcattgga cacacactac tggacttgga bcaaccattt 3180 
tgtcatctgg gggtcgctgc. tgttctacgt tgtcttttca cttctctggg gaggagtgat 3240. 
ctggccgttc ctcaactapc ag^ggatgta ctaagtgt.tc atccagatgt tgtccagcgg 3300 
gcccgcctgg ctggccatcg tgctgctggt gaccatcagc ctccttcccg acgtcctcaa 3360 
gaaagtcctg tg.ccggcagp tgt^gccaac agcaacagag ^gagtccagc agaatgggtg 3420 
cgcacagcct cgggaccgcg actcagaatt cacccctctt gcctctctgc agagcccagg 3480 
ctaccagagc acctgtccct cggccgcctg gtabagctcc cactctcagc aggtgacact 3540 
cgcggcctgg aaggagaagg tgtccacgga gcccccaccc atcctcggcg gttcccatca 3 600 
ccactgcagt tccatcccaa gtcacagctg ccctaggtcc cgtgtgggaa tgctcgtgtg 3 660 
atggatggtc ctaagcctgt ggagactgtg cacgtgcctc ttcctggccc ccagcaggca 3720 
aggagggggg tcacaggcct tgccctcgaa catggcaccc tggccgcctg gacccagcac 3780 

. ' . 378.3 

<210> 50 • . ' , I * . . 

<211> 2105 . 

<212> DNA ... 

<213> Homo sapiens 

<220> ■,»..' 

<221> misc_feature 

<223> Incyte. ID No: 7474127CB1 

<400> 50 . . 

ccagcgccca 'gggaagcggc tcaaccacct gaajtccggaa aacgccaaca agtagtttct 60 
cgtcggagaa gggcggctca cctgggcgcc aagactcagt cccgctgccc agagaacctc 120 
gtccactcgg aaaccaaagc agaaccactt ttctctcggt ctcgttaagt catgtctgag 180 
tcacagagat gggcaagatc gagaslc^acg agagggtgat cctcaatgtc gggggcaccc 240 
ggcacgaaac ctaccgcagc accctcaaga ccctgqctgg aacacgcdtg gcccttcttg 3 0o' 
cctcctccga gcqcccaggc gactgcttga ccacggcggg. cgacaagctg cagccgtcgc 360 
cgcctccact gtcgccgccg ccgagagcgc ccccgctgtc ccccgggcca ggcggctgct 420 
tcgagggcgg cgcgggcaac tgcagttccc gcggcggcag ggccagcgac cateccggtg 480 
gcggccgcga gttcttcttc gaccggcacc cgggcgtctt cgcctatgtg ctcaattact 540 
accgcaccgg caagctgcac tgccccgcag acgtgtgcgg gccgctcttc gaggaggagc 600 
tggccttctg gggcatcgac gagaccgacg tggagccctg ctgctggatg acctaccggc 660 
agcaccgcga cgccgaggag gcgctggaca tcttcgagac ccccgacctc attggcggcg 720 
accccggcga cgacgaggac ctggcggcca agaggctggg catcgaggac gcggcggggc 780 
tcgggggccc cgacggcaaa tctggccgct ggaggaggct gcagccccgc atgtgggccc 840 
tcttcgaaga cccctactcg tccagagccg ccaggtttat tgcttttgct tctttattct 900 
tcatcctggt ttcaattaca actttttgcc tggaaacaca tgaagctttc aatattgtta 960 
aaaacaagac agaaccagtc atcaatggca caagtgttgt tctacaatat gaaattgaaa 1020 
cagatcctgc. cttgacgtat gtagaaggag tgtgtgtggt gtggtttact tttgaatttt 1080 
tagtccgtat tgttttttca cccaataaac ttgaattcat caaaaatctc ttgaatatca 1140 
ttgactttgt ggccatccta cctttctact tagaggtggg actcagtggg ctgtcatcca 1200 
aagctgctaa agatgtgctt ggcttcctca gggtggtaag gtttgtgagg atcctgagaa 12 60 
ttttcaagct cacccgccat tttgtaggtc tgagggtgct tggacatact cttcgagcta 1320 
gtactaatga atttttgctg ctgataattt tcctggctct aggagttttg atatttgcta 1380 
ccatgatcta ctatgccgag agagtgggag ctcaacctaa cgacccttca gctagtgagc 1440 
acacacagtt caaaaacatt cccattgggt tctggtgggc tgtagtgacc atgactaccc 1500 
tgggttatgg ggatatgtac ccccaaacat ggtcaggcat gctggtggga gccctgtgtg 1560 
ctctggctgg agtgctgaca atagccatgc cagtgcctgt cattgtcaat aattttggaa . 1620 
tgtactactc cttggcaatg gcaaagcaga aacttccaag gaaaagaaag aagcacatcc 1680 
ctcctgctcc tcaggcaagc tcacctactt tttgcaagac agaattaaat atggcctgca 1740 
atagtacaca gagtgacaca tgtctgggca aagacaateg acttctggaa cataacagat 1800 
cagtgttatc aggtgacgac agtacaggaa gtgagccgcc actatcaccc ccagaaaggc 1860 
tccccatcag acgctctagt accagagaca aaaacagaag aggggaaaca tgtttcctac 1920 
tgacgacagg tgattacacg tgtgcttctg atggagggat caggaaaggt tatgaaaaat 1980 
cccgaagctt aaacaacata gcgggcttgg caggcaatgc tctgaggctc tctccagtaa 2040 
catcacccta caactctcct tgtcctctga ggcgctctcg atctcccatc ccatctatct 2100 
^9^^^ 2105 

<210> 51 
<211> 2069 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
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<223> Incyte ID No: 7476949CB1 

<400> 51 M • • . ' • • 

atgagcaagg acctggcagc aatggggcct ggagcttcag gggacggggt caggactgag 60 ; 
acagctccac acatagcact ggacjtccaga gttggtctgc ^cgcctacga catca^cgtg, 120 
gtggtcatct actttgtctt cgtcattgct gtggggatct ggtcgtccat ccgt'gcaagt 180 
cgagggacca jbtggcggcta tttcctggcc gg^^ggtcca tgagctggtg gccaattgga 240 
gcatctctga tgtccagcaa tgtgggcagt ggcttgttca tcgg;cctggc tgggacaggg 3 00 
gctgccggag gccttgccgt aggtggcttc gagtggaacg caacctggct gctcctggcc 3 60 
cttggctggg tcttcgtccc tgtgtacatc gcagcaggtg. tggtcacaat gccgcagtat 420 
ctgaagaagc gatttggggg ccagaggatc. caggtgtaca tgtctgtcct gtctctcatc 480 
ctctacatct tcaccaagat ctcgactgac . atcttctctg gagccctctt catccagatg 540 " 
gcattgggct ggaacctgta fctctccaca gggatcctgq tggtggtgac tgccgtctac 600 
accattgcag gtggcctcat ggccptgatc tacacagatg 'ctctgcagac ggtgatcatg 660 
gtagggggag ccctggtcct catgtttctg ggctttcagg acgtgggctg gtacccaggc 720 
ctggagcagc ggtaqaggca ggccatccct aatgtcacag tccccaacac cacctgtcac 780 
ctcccacggc ccgatgcttt ccacattctt cgggaccctg tgagcgggga catcccttgg 840 
ccaggtctca ttttcgggdt cacagtgctg gccacctggt gttggtgcac agaccaggtc 900 
attgtgcagc ggtctc^ictc ggccaagagt ctgtctcatg ccaagggagg ctccgtgctg 960 
gggggctacc tgaagatcct ccccatgttc ttcatcgtca tgcctggcat gatcagccgg 1020 
gccctgttc^c cagacgaggt. gggctgcgtg gaccctgatg tctgccaaag aatctgtggg 1080 
gcccgagtgg gatgttccaa cattgcctac cctaagttgg tcatggccct catgcctgtt 1140' 
ggtctgcggg ggctgatgat tgccgtgatc atggccgptc tcatgagctc actcacctcc 1200 
atcttcaaca gcagcagcac cctgttcacc attgatgtgt ggcagcgctt ccgcaggaag 1260 
tcaacagagc aggagctgat ggtggtgggc agagbgtttg tggtgttcct ggttgtcatc 1320 
agcatcctct ggatccccat catccaaagc tccaacagtg ggcagctctt cgactacatc 1380 
caggctgtca ccagttacct ggccocaccc atcaccgctc tcttcctgct ggccatcttc 1440 
tgcaagaggg tcacagagcc cggagctttc tggggpctcg tgtttggcct gggagtgjggg 1500 
cttctgcgta tg^tcctgga gttctcatap ccagcgccag cctgtgggga ggtggaccgg 1560 
aggccagcag tgctgaag^a cttccactac ctgtactttg caatcctqct ctgcgggctc 1620 
actgccatcg tcattgtcat tgtcagcctc tgtacaactc ccatccctga ggaacagctc 1680 
acacgcctca catggtggac tcggaactgc cccctctctg agctggagaa ggaggcccac 1740 
gagagcacac cggagatatc cgagaggcca gccggggagt gccctgcagg aggtggagcg 1800 
gcagagaact cgagcctggg ccaggagcag cctgaagccc caagcaggtc ctggggaaag 1860 
ttgctctgga gctggttctg tgggctctct ggaacaccgg agcaggccct gagcccagca 1920 
gagaaggctg cgctagaaca gaagctgaca agcattgagg aggagccact ctggagacat 19 80 
gtctgcaaca tcaatgctgt ccttttgctg gccatcaaca tcttcctctg gggctatttt 2040 
gcgtgattca aacctggctt cactgtaga 2069 

<210> 52 
<211> 4245 
<212> Djm 

<213> Homo sapiens . 

<220> ' . 

<221> misc^feature 

<223> Incyte ID No: 7477249CB1 

<400> 52 

gcggcggcag gctcagctgc gccgggcggg ggcggcgctg gggccgcgcc tgtaggactc 60 
ggggccgacg ccgcgggatg gggacgcggc gcggggagtg aggcagtggc ggcggcggcg 120 
gtaagcggaa cttcggcccg aggggctcgc ccgctcccgc ctctgtcttg tcggcctcca 180 
cctgcagccc cgcggccccc gcgccccgcg ggacccggac ggcgacgacg ggggaatgtg 240 
gcgctggatc cggcagcagc tgggttttga cccaccacat cagagtgaca caagaaccat 3 00 
ctacgtagcc aacaggtttc ctcagaatgg cctttacaca cctcagaaat ttatagataa 3 60 
caggatcatt tcatctaagt acactgtgtg gaattttgtt ccaaaaaatt tatttgaaca 420 
gttcagaaga gtggcaaact tttattttct tattatattt ttggttcagc ttatgattga 480 
tacacctacc agtccagtta ccagtggact tccattattc tttgtgataa cagtaactgc 540 
cataaagcag ggatatgaag attggttacg gcataactca gataatgaag taaatggagc 600 
tcctgtttat gttgttcgaa gtggtggcct tgtaaaaact agatcaaaaa acattcgggt 660 
gggtgatatt gttcgaatag ccaaagatga aatttttcct gcagacttgg tgcttctgtc 720 
ctcagatcga ctggatggtt cctgtcacgt tacaactgct agtttggacg gagaaactaa 780 
cctgaagaca catgtggcag ttccagaaac agcattatta caaacagttg ccaatttgga 840 
cactctagta gctgtaatag aatgccagca accagaagca gacttataca gattcatggg 900 
acgaatgatc ataacccaac aaatggaaga aattgtaaga cctctggggc cggagagtct 960 
cctgcttcgt ggagccagat taaaaaacac aaaagaaatt tttggtgttg cggtatacac 1020 
tggaatggaa actaagatgg cattaaatta caagagcaaa tcacagaaac gatctgcagt 1080 
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agaaaagtca atgaatacat ttttgataat ttatctagta attcttatat ctgaagctgt 1140 
catcagcact atcttgaagt atacatggca agctgaagaa aaatgggatg • aaccttggta 1200 
taaccaaaaa acagaacatc aaagaaatag cagtaaggta gagtacgtgt tta:cagataa 1260 
aactggtaca ctgacagaaa atgagatgca gtttcgggaa tgttcaatta atggcatgaa 1320 
ataccaagaa attaatggta gacttgtacc cgaagrgacca acaccagact ct;tcagaagg 1380 
aaacttatct tatcttagta gtttatcccA tcttaacadd ttatcccatc ttacaaccag. 1440 
ttcctctttc agaaccagtc ctgaaaatga aactgaacta attaaagaac atgatctctt 1500' 
ctttaaagca gtcagtctct gtcacactgt acagattagc aatgttcaaa ctgactgcac 1560 
tggtgatggt ccctggcaat ccaacctggc accatcgcag ttggagtact atgcatcttc 1620 
accagatgaa aaggctctag tagaagctgc tgcaaggatt ggtattgtgt ttattggcaa 1680 
ttctgaagaa actatggagg ttaaaactct tggaaaactg gaacggtaca aactgcttca 1740 
tattctggaa tttgattcag atcgtaggag aatgagtgta attgttcagg caccttcagg 1800 
tgagaagtta ttatttgcta aaggagctga gtcatdaatt ctcc.ctaaat gtataggtgg 1860 
agaaatagaa aaaaccagaa ttcatgtaga tgaatttact ttgaa^gggc taagaactct 1920 
Sftgtatagca tatagaaaat ttacatcaaa agagtat^ag gaaatagata aacgcatatt 1980 
tgaagccagg actgccttgc agcagcggga agagaaattg gcagctgttt tccagttcat 2040 
agagaaagac ctgatattac ttggagccac agcagtagaa gacagactac aagataaagt: 2100 
tcgagaaact attgaagcat tgagaatggc tggtatcaaa gtatgggtac ttactgggga 2160 
taaacatgaa acagctgtta gtgtgagttt atcatgtggc. pattttcata gaaccatgaa 2220 
•catccttgaa cttataaacc agaaatcaga cagcgagtgt gctgaacaat t gaggc age t 2280 
tgccagaaga attacagagg atcat^tgat tcagcatggg ctggtagtgg atgggaccag 2340. 
cctatctctt gcactcaggg agcatgaaaa actatttatg gaagtttgca gaaattgttc 2400 
agctgtatta tgctgtcgta . tggctccact gcagaaagca aaagtaataa gactaataaa 2460 
aatatcacct gagaaaccta taacattggc tgttggtgat ggtgctaatg acgtaagcat 2520 
gatacaagaa gcccatgttg gcataggaiat catgggtaaa gaaggaagac aggctgcaag 2580 
aaacagtgac tatgcaatag ccagatttaa gttcctctcc aaattgcttt ttgttcatgg 2 640 
tcatttttat tatattagaa tagctaccct tgtacagtat tttttttata agaatgtgtg 2700 
ctttatcaca ccccagtttt tatatcagtt ctactgtttg ttttctdagc aaacattgta 2760 
tgacagcgtg tacctgactt tatacaatat ttgttttact tccctaccta ttctgatata 282 0 
tagtcttttg gaacagcatg tagaccctca tgtgttacaa aataagccca ccctttatcg 2880 
agacattagt aaaaaccgcc tcttaagtat taaaacattt cttt^attgga ccatcctggg 2940 
cttcagtcat gcctttattt tcttttttgg atcctattta ctaataggga aagatacatc 3 000 
tctgcttgga aatggccaga tgtttggaaa ctggacattt ggcactttgg tcttcacagt 3060 
catggttatt acagtcacag taaagatggc tctggaaact catttttgga cttggatcaa 3120 
ccatctcgtt acctggggat ctattatatt ttattttgta ttttccttgt tttatggagg 3180 
gattctctgg ccatttttgg gctcccagaa tatgtatttt gtgtttattc agctcctgtc 3240 
aagtggttct gcttggtttg ccataatcct catggttgtt acatgtctat ttcttga tat 33 00 
cataaagaag gtctttgacc gacacctcca ccctacaagt actgaaaagg cacagcttac 3360 
tgaaacaaat gcaggtatca agtgcttgga ctccatgtgc tgtttcccgg aaggagaagc 3420 
agcgtgtgca tctgttggaa gaatgctgga acgagttata ggaagatgta gtccaaccca 3480 
catcagcagg tgtgaaatct ctctaagtag cctttgctgc agatgagtat cctatctgga 3540 
acaggatgaa cctgccgctc tagataccta ataaatcagc agctggtttt accaactgaa 3600 
gcaggaagtc tgctatttat tagcactctt tggtggtaga tttcactttg tggctttggg 3660 
gt^agggctt tttcactcac aaaggaagag aaagcacctt tgaagagact tcatctaatg 3720 
aacaaaaaat tttgtttcat aatctttcta aaatgtgctc agtaggagtg tgtttatggt 3780 
actcttttat ggtttgtata actttctttt ttaaattata catatactat ttccttttta 3 840 
tttttttaaa atttttttgc tttttgtctt tacaaaataa fectcaacata acagtgaagt 3900 
caaaggcttt ccttttctta ctctgtatgt atattttcca gttggttatt tgaggctttg 3 960 
aggtatttat aaacacaaaa ggctgtattt ctgctcccct acctcttctt atgtctgtaa 4020 
tgaagttttg aaatgagtca tgatttttaa gtttcttttg cttggtattt attgcctaat 4080 
taaaagtgta tgagttagaa caggcttttt ^aattatgga gtaaaagaat cttagcattt 4140 
ttgtcccctc ctaaatctgt ttcttgaatg agatttatca ccatgcctgc tgttgtgcac 4200 
cataacgaaa aaaaacacct tttggtaaac accatttaaa attca ' 4245 

<210> 53 

<211> 2124 

^212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7477720CB1 

<400> 53 

atggctctgc agatgttcgt gacttacagt ccttggaatt gtttgctact gctagtggct 60 
cttgagtgtt ctgaagcatc ttctgatttg aatgaatctg caaattccac tgctcagtat 12 0 
gcatctaacg cttggtttgc tgctgccagc tcagagccag aggaagggat atctgttttt 180 
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gaactggatt 
tcccttgcaa 
tgcctcctca 
tcgcctccgg 
gagggcggct 
tgggcagtat 
tgccaggtga 
agcctgatct 
aacgagcagc 
gtcttataca 
gtcgacattt 
ggcatcgttt 
attgagccac 
tatctctccg 
ga:aaacgtgt 
gtcagcgaga 
. tggaactggg 
gtatttgctc 
cagtgcatca 
cttcctctgt 
tacttta:ctg 
aaaaaaacca 
ttaaaggctg 
tttaagaagt 
tcaagcattg 
actgggatac 
atcaaaagac 
taccaagttc 
gagaagcagg 
aaaggtcaca 
tacccctacg 
aagc tgccca 
gtgtgggcca 



atgactatgt 

aaataggctt- 

tcctggtggg 

tcatggactc 

acttcatgcc 

tgggggccct 

aggcctttgg 

ccgccgtgga 

tctacatgat 

atatgttaat 

tggctggatg 

ttggatttat 

tcatcgtctt 

gcatcctggc 

cccagacatc 

ccttgatctt 

ccttcatctg 

'tcttctatat 

ttttctackg 

ctctttttcc 

tatttattca 

ataaaaaaga 

gaatcgaaga 

ttgatcatag 

tttctttgta 

tgagctctac 

tttcccctga 

ggcaaaggac 

ctaaagagat 

gcctgccctg 

ggaatcctca 

cctggctgct 

agaggcaaca 



gcaaattcct 
ccacctctac 
g^fcgctggtg 
cagcatctac 
cacqcggccc 
gatcaacgcc 
cctgggcgac 
cccagtggcc 
gatctttggg 
tgcctttaca 
tgcccgattc 
ttctgcattt 
catgttcagc 
aatqacagcc 
atacacgacc 
catcttcatg 
cttcaccctg 
cagta^ccag 
tggtgttcga 
taggaagaaa 
gggaatcaca 
atccatcaat 
tgtgtgtggg 
atacttacgg 
caagaagctg 
agctttctcc 
ag^tgit^gag 
cctgtcctac 
tctgatccgc 
gggaaagccg 
gtctgcagga 
ctgctgcttt 
ttaa 



tatgaggtca 
cacaggctgc 
ggcggcsttca 
ttcctgtatc 
ttctttgaga 
ttgggcattg 
gtcaacctgc 
gtgctagccg 
gaggccctgc 
aagatgcata 
atcgttgtgg 
atcacacgtt 
tatttgtctt 
tgcgcagtaa 
atcaagtact 
ggtgtgtcca 
gfccttctgcc 
tttcggactt 
ggagctggaa 
atgtttgtca 
gttggccctc 
gaagagcttc 
cactggagtc 
aaaatcctca 
gaaatgaagc 
ataccccatc 
tccataaggg 
aacaaataca 
cgccagaaca 
gctggcacca 
agagacacaa 
tctgtagagt 



ctctctggat 
caggcctcat 
tcttcggcad 
tcctgccacc 
acatcggctc 
gcctctccct 
tgcagaacct 
tgtttgagga 
tcaatgatgg 
aatttgaaga 
ggcttggagg 
tca'ctcagaa 
acttagctgc 
'caatgaaaaa 
tcatgaagat 
ctgtgggcaa 
aaatctggag 
tccccttctc 
gtttttcact 
ctgctactct 
tggtcaggta 
atattcgtct 
actaccaagt 
tcagaaagaA 
aagccatcga 
aggcccagag 
acattctgac 
acctcaaAcc 
ccttaaggga 
agaatatacg 
gggctgctgg 
caggtggtaa 



acttctagca 
gccagaaagc 
cgaccacaaa 
catcgttctg 
catcc^gtgg 
ctacctcatc 
gctgttcggc 
agcgcgcgtg 
cattactgtg 
catagaaact 
ggtattgttt 
tatctctgca. 
tgaaaccctc 
gtacgtggaa 
gctgagcagc 
gaatcacgag 
agccatcagc 
catcaaggac 
tgcatttttg 
agtagttata 
cctggatgtt 
gatggatcac 
gagagacaag 
cctacccaaa 
gatggtggag 
gatacaagga 
atccaacatg 
ccaaac^agt 
gagcatgagg 
ctacctctcc 
gttctcaggt 
atatctgggg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

idio I 

1080 * 

1140 

1200 

1260 

1320 

1380 

1440 • 

1500 

1560 

1620 

1680 

1740 

180Q 

1860 

1920 . 

1980 

2040 . 

2100 

2124 . 



<210> 54 
<211> 2195 
<212> DNA 

<213> Homo sapiens . 
<220> 

<221> mis cofeature 

<223> incyte ID No: 7477852CB1 



<4d0> 54 

atggggggtt 

tcctttctgg 

cagaagagga 

cttaggcaac 

acggcgctgc 

gctgccccag 

ctgcacatcg 

gccagtgtct. 

tactttggtg 

ctgctcattg 

cacatcctca 

tcctacgatg 

ctcaccccct 

cagaagcgga 

acagagatcg 

aaacgagagg 

tggaacaagt 

atctgcttta 

cattctcgag 

gaagatatca 

ctcctagaga 

cttggggggc 

gtgatgcggc 



ttctacctaa 
tcagagaaca 
ttctagagtc 
ttctactgga 
ac at age age 
agctggtctt 
ctgttgtgaa 
ctgccagagc 
agcacccttt 
agcatggagc 
tcctccagcc 
gacatgggga 
tcaagctggc 
ggcacatcca 
actcctgggg 
ctcgccaaat 
atggccggcc 
ccacgtgctg 
acatcaccat 
tcaggctggt 
ttccagacat 
cattccatgt 
tcaccaacac 



ggcagaaggg 
agac tgggac 
tccactgctt 
ctgcacctgt 
cctctatgac 
tgagcccacc 
ccagaatgtg 
cacaggcact 
gtcctttgct 
tgacatcagg 
caacaaaacc 
ccacctgcag 
tggagtggag 
gtggacgtat 
agaggagctg 
tctggaacag 
gtacttctgc 
cgtctaccgc 
cct ccagc aa 

gggggagctg 

cttcagggtt 
catcatgatc 
caatggggag 



cccgggagcc 
cagcacctgg 
cgagcatcca 
gacgttcgac 
aacttggagg 
acatgtgagg 
aacchggtgc 
gccttccgcc 
gcctgtgtga 
gcccaggact 
tttgcctgcc 
cccctggacc 
ggtaacactg 
ggacccctga 
tccttcctgg 
accccagtga 
atcctggctg 
ccccttaagt 
aaactactac 
gtgagcatcg 
ggtgcctctc 
acctatgcct 
gtggtgccca 



aactccagaa 
acaagcttca 
aggaaaatga 
aaagaggagc 
cggccttggt 
cttttgcagg 
gtgccctgct 
gtagtccccg 
acagcgagga 
ccctgggtaa 
agatgtacaa 
ttgtgcccaa 
tgatgttcca 
cctccattct 
agcttgtggt 
aggagctggt 
ccttgtacct 
ttcgtggtgg 
aggaggccta 
ttggggctgt 
gctatt ttgg 
ccctggtgct 
tgtcctttgc 



acttctgccc 
tatgctgcag 
cctgtctgtt 
cctgggggag 
gctgatggag 
tcagactgca 
cacccgcagg 
caacctcatc 
gatcgtgcgg 
cacagtatta 
cctgctgctg 
tcaccagggt 
gcacctgatg 
ctacgacctc 
ctcctctgat 
gagcttcaag 
gctctacatg 
caaccgcact 
tgagacacgt 
gatcatcctg 
aaagacgatt 
ggtgaccatg 
cctggtgctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780. 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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ggct^gtgca gtgtcatgta tttcactcga ggattccaga tgctgggtcc cttcaccatc 1440 

atgatccaga agatgatttt tggagaccta atgcgtttct gctggctgat • ggctgtggtc 1500 

atcttgggat ttgcctccgc gttctatatc attttccaga cagaggaccc aaccagtctg 1560 

gggcaattct atgactaccc catggcactg ttcaccacct ttgagctttt tctcactgtt 1620 

attgatgcac ctgccaacta cgacgtggac ttgcpccttca tgttcagcat tgtcaacttc 1680 

gccttcgcca tcattgccac actgctcatg ctcaacttgt tcatc^ccHt gatgggcgac 1740 

acccactgga gggtggccca ggagagggat gagctctgga gggcccaggt cgtggccacc 1800' 

acagtgatgc tggagcggaa. gctgcctcgc tgcctgtggc ctcgatccgg gatctgtggg' 1860 

tgcgaattcg ggctggggga ccgctggttc ctgcgggttg agaaccacaa tgatcagaat 1920 

cctctgcgag tgcttcgcta tgtggaagtg ttcaagaact cagacaagga ggatgaccag 1980 

gagcatccat ctgagaaaca gccctctggg gctgagagtg ggactctagc cagagcctct 2040 

ttggctcttc caacttcctc cctgtcccgg accgcgtccc agagcagcag tcaccgaggc 2100 

tgggagatcc ttcgtcaaaa caccctgggg cacttgaatc ttggactgaa ccttagtgag 2160 

ggggatggag aggaggtcta ccatttttga ttaac ^ , ; . 2195 

<210> 55 • 
<211> 2055 ' ■ ' 
<212> DNA .... I . 

<213> Homo sapiens 

<2'20> * . ■ . ■ . ' ' " 

.<221> misc_f eature 

<223> Incyte ID No: 1471717CB1 

<*400> 55 

cggctctggg ccctcagcct ggctcatgca caactgtctg aagtgctctg gactatggtg 60 
atgaacagcg gccttcagac gcgaggctgg ggaggaatcg tcggggtttt tattattttt 120 
gccgtatttg ctgtcctgac agtagccatc cttctgatca tggagggcct ctctgcttto 180 
ctgcacgccc tgcgactgca ctgggaagct gtttggggaa cttgagctat ttagaagatg 240 
gcaaccaagc caacagagcc tgtcacgatc ctcagccttc ggaaattgag cctggggacc 300 
gcagagccac aggttaaaga gccaaagacg ttcaccgtgg aaga,tgcagt ggagactatc 360 
ggcttcgggc gtttccacat tgccctcttt ctgatcatgg gcagtactgg ggtggttgag 420 
gccatggaga tcatgttgat agctgttgtg tctcctgtca tccgctgtga atggcaactg 480 
gagaattggc aggtggcatt agtaaccacg atggtgtttt ttggctacat ggttttcagt 540 
atcctctttg gcctcctggc tgacagatat ggccgctgga agattctjgct catctcgttc 600 
ctgtggggag cctatttctc cttgctgacc tcgtttgctc cttcgtacat ctggtttgtc 660. 
ttcctgcgga cgatggtggg ctgtggtgtg tccggccact cgcaagggtt aatcataaag 720 
actgaatttt tgcccacgaa ataccgaggc . tatatgttac ccttgtctca ggtgttctgg 780 
cttgcgggct ccctgctcat cattggcttg gcctctgtga tcatccccac catcgggtgg 840 
cgctggctca ttcgcgtcgc ctccatcccg ggcatcatcc tcatcgtggc cttcaagttt 900 
attcctgaat ctgcccggtt caatgtctcc actgggaaca ctcgggctgc cctggccact 960 
ctggagcgcg ttgccaagat gaaccgctcg gtcatgccgg aggggaagct ggtggagccc 102 0 
gtcctggaaa aaagaggaag atttgcagac ctattggatg ctaaatattt acggaccaca 1080 
ttacagatct gggtcatatg gcttggaatc tcttttgcct actatggggt tatcctggcc 1140 
agtgctgagc tgctggagcg ggacttggtc tgtggttcaa agtcagactc tgcggtggtg 1200 
gtgactgggg gggactcagg ggagagccag agcccctgct actgccacat gtttgcaccc 1260 
tctgactatc ggaccatgat catcagcacc atcggtgaaa ttgctttgaa tcctttaaat 1320 
atactgggca tcaatttcct gggaagacgg ctgagccttt ctattaccat gggatgcacg 1380 
gctttattct gccttctcct caacatttgc acttcaagtg ccggcctgat tggcttcctc 1440 
ttcatgctga gggctctggt agctgcaaac ttcaacaccg tctacattta cacagctgag 1500 
gtctacGGca ccacgatgcg cgctttgggg atgggaacca gcggctccct gtgtcgcatt 1560 
ggtgcaatgg tggcgccatt tatatcqcag gttcttatga gtgcatcaat actgggggcc 1620 
ctgtgtctct tctcatctgt ctgtgttgta tgcgccattt ctgcattcac tctccccatc 1680 
gaaaccaaag gacgggccct ccagcaaatt aaatgaagac ctgcaaagct atgtctacca 1740 
gatgagaaaa atgaattcta tcttcagaac tgcggtgcat ttttttaaaa cttggtttta 1800 
cttctgtatg ctactcggta attagtaaag tgattttttt ttaaaaggca tatatgggaa 1860 
tggggtaggt aactgtatat tgatctcttc cttgaggaac aatatataaa gtacttttat 1920 
aaaatataat ttaagctttc aaaggggtgt gagagggaga tggtgggggg gaagatggct 1980 
tttcttcgtt gaaatcaagt ctgtaaacct ttatatgaat aaatactaaa ttttaaactt 2040 
acaaaaaaaa aaaaa 2055 

<210> 56 

<211> 4727 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> misc^feature 

<223> Incyte ID. No: 3 874406CB1 

<400> 56 

aagagctgct ggagtaggca cccatttaaa gaaaaaatga, agaagcagca ataaagaagt 60 
tgtaatcgtt acctagacaa acagagaact ggttttgaca gtgtttctag agtgcttttt 120 . 

attattt'tcc tgacagttgt gttccaccat gat tact ttc tccttcagcg aataggctaa 180 
atgaatatga aacagaaaag cgtgtatcag caaaccaaag cacttdtgtg caagaatttt 240 
cttaagaaat ggaggatgaa aagagagagc ttattggaat ^gggcctctc aatacttcta 300 
ggactgtgta ttgctctgtt ttccagttcc atjgagaaatg tccagtttcc tggaatggct 360 
cctcagaatc tgggaagggt agataaattt aatagctctt ctttaatggt tgtgtataca 420 
ccaatatcta atttaaccca gcagataatg aataaa^cag cacttgctcc. tcttttgaaa 480 
ggaacaagtg tcattggggc accaaataaa acacacatgg acgaaatact tctggaaaat 540 
ttaccatatg ctatgggaat- catctttaat gaaactttpt cttataagtt aatatttttc 600 
cagggatata acagtccact ttggaaagaa gatttctcag ctcattgctg ggatggatat. 660 ' 
ggtgagtttt catgtacatt gaccaaatac tggaatagag gatttgtggc tttacaaaca. 720 
gctattaata ctgccattat agaagtagct ttggtgttcc tgatgagtgt gctgttaaag . 780 
aaagctgtcc tcaccaattt ggtt'gtgttt ctccttaccc • tcttttgggg atgtctggga 840 
ttcactgtat tttatgaaca acttccttca tctctggagt- ggattttgaa tatttgtagc 900 
ccttttgcct ttactactgg aatgattcag attatcaaac tggattataa cttgaatggt 960 
gtaatttttc ctgacccttc aggagactca tatacaatga tagcaacttt ttctatgttg 1020 
cttttggatg gtctcatcta cttgctattg gcattatact ttgacaaaat tttaccctat 1080 
ggagatgagc gccattattc tcctttattt . ttcttgaatt catcatcttg tttccaacac 1140 

I caaaggacta atgctaaggt tattgagaaa gaaatcgatg ctgagcatcc ctctgatgat 1200 
tattttgaac cagtagctcc tgaattccaa ggaaaagaag ccatcagaat cagaaatgtt 1260 
aagaaggaat ataaaggaaa atctggaaaa gtggaagcat tgaaaggctt gctctttgac 1320; 
atatatgaag gtcaaatcac ggcaatcctg ggtcacagtg gagctggcaa . atcttcactg 1380 
ctaaatattc ttaatggatt gtctgttcca acagaaggat cagttaccat c tat aataaa 1440 
aatctctctg aaatgcaaga cttggaggaa atcagaaaga taactggcgt ctgtcctcaa 1500 
ttcaatgttc aatttgacat actcaccgtg aaggaaaacc tcagcctgtt tgctaaaata 1560 
aaagggattc atctaaagga agtggaacaa gaggtacaac gaatattatt ggaattggac 1620 
atgcaaaaca ttcaagataa ccttgctaaa catttaagtg aaggacagaa aagaaagctg 1680 
acttttggga ttaccatttt aggagatcct caaattttgc tcttagatga accaactact 1740 
ggattggatc ccttttccag agatcaagtg tggagcctcc tgagagagcg tagagcagat 1800 
catgtgatcc ttttcagtac ccagtccatg gatgaggctg acatcctggc tgatagaaaa 1860 . 
gtgatcatgt ccaatgggag actgaagtgt gcaggttctt ctatcttttt gaaaagaagg 1920 
tggggtcttg gatatcacct aagtttacat aggaatgaaa tatgtaaccc agaacaaata 1980 
acatccttca ttactcatca catccccgat gctaaattaa aaacagaaaa caaagaaaag 2040* 
cttgtatata ctttgccact gga:aaggaca aatacatttc cagatctttt cagtgatctg 2100 
gataagtgtt ctgaccaggg agtgacaggt tatgacattt ccatgtcaac tctaaatgaa 2160 
gtctttatga aactggaagg acagtcaact atcgaacaag atttcgaaca agtggagatg 2220 
ataagagact cagaaagcct caatgaaatg gagctggctc actcttcctt ctctgaaatg 2280 
cagacagctg tgagtgacat gggcctctgg agaatgcaag tctttgccat ggcacggctc 2340 
cgtttcttaa agttaaaacg tcaaactaaa gtgttattga ccctattatt ggtatttgga 2400 
atcgcaatat tccctttgat tgttgaaaat ataatatatg ctatgttaaa tgaaaagatc 2460 
gattgggaat ttaaaaacga attgtatttt ctctctcctg gacaacttcc ccaggaaccc 2520 
cgtaccagcc tgttgatcat caataacaca gaatcaaata tfcgaagattt tataaaatca 2580 
ctgaagcatc aaaatatact tttggaagta gatgactttg siaaacagaaa tggtactgat 2640 
ggcctctcat acaatggagc tatcatagtt tctggtaaac aaaaggatta tagattttca 27 00 
gttgtgtgta ataccaagag attgcactgt tttccaattc ttatgaatat tatcagcaat 2760 
gggctacttc aaatgtttaa tcacacacaa catattcgaa ttgagtcaag cccatttcct 2820 
cttagccaca taggactctg gactgggttg ccggatggtt cctttttctt atttttggtt 2880 
ctatgtagca tttctcctta tatcaccatg ggcagcatca gtgattacaa gaaaaatgct 2940 
aagtcccagc tatggatttc aggcctctac acfctctgctt actggtgtgg gcaggcacta 3000 
gtggacgtca gcttcttcat tttaattctc cttttaatgt atttaatttt ctacatagaa 3 060 
^acatgcagt accttcttat tacaagccaa attgtgtttg ctttggttat agttactcct 3120 
ggttatgcag cttctcttgt cttcttcata tatatgatat catttatttt tcgcaaaagg 3180 
agaaaaaaca gtggcctttg gtcattttac ttcttttttg cctccaccat catgttttcc 3240 
atcactttaa tcaatcattt tgacctaagt atattgatta ccaccatggt attggttcct 33 00 
tcatatacct tgcttggatt taaaactttt ttggaagtga gagaccagga gcactac^ga 33 60 
gaatttccag aggcaaattt tgaattgagt gccactgatt ttctagtctg cttcataccc 3420 
tactttcaga ctttgctatt cgtttttgtt ctaagatgca tggaactaaa atgtggaaag 3480 
aaaagaatgc gaaaagatcc tgttttcaga atttcccccc aaagtagaga tgctaagcca 3540 
aatccagaag aacccataga tgaagatgaa gatattcaaa cagaaagaat. aagaacagtc 3600 
actgctctga ccacttcaat cttagatgag aaacctgtta taattgccag ctgtctacac 3660 
aaagaatatg caggccagaa gaaaagttgc ttttcaaaga ggaagaagaa aatagcagca 3720 
agaaatatct ctttctgtgt tcaagaaggt gagattttgg gattgctagg acccagtggt 3780 
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gctggaaaaa 

gtggaactga' 

gtgctgtggc 

ctcagggaag 

gagcagctga 

g t get gage c 

gaccccacag 

agaggtgtcc 

gccatcatgg 

cttggcaagg 

cacactgaga 

ttaacctata 

gaagcagtga 

aaggtattct 

acaacaatga 

aattttcttg 



gttcatctat 
aaggctgcag 
ccatgctg^p 
cggacgcgag 
atgttcctgt 
tcctgg'gaaa 
ggcagcagca 
tcctgaccac 
tgtctggaag 
attacattct 
ttctgaagct 
agctgcccgt 
agcataactt 
tagagctttc 
gatggaaact 
cttg^tctcc 



tagaatgata 
ttcagttttg 
gbtgagggaa 
gctcgccatc 
gcagaaatta 
ctcacctgtc 
aatgtggcag 
ccataacctg 
gcttagatgc 
agagctaaaa 
tttcccacag 
ggcagacgtt 
taacctggaa 
taaa^aacag 
cctccctcat 
tataaactta 



tctgggatca 
ggccacctgg 
cacqtggagg 
gcaagattag 
acagcaggaa 
ttgctcctgg 
gcaatccagg 
gctgaggcgg 
attggctcca 
gtgaaggaaa 
gctgcagggc 
taccctctat 
gaatacagcc 
gaagtaggaa 
tcagatgaac 
tgttttatgt 



caaagccaac 
ggtactgccc 
tgtatgctgo: 
tgagtgcttt 
tcacgagaaa 
atgaaccatc 
cagtcgttaa 
aagccttgtg 
tcca;acacct 
cgtctcaagt 
agcaaaggta 
cacagacctt 
tttctcagtg 
attttgatga 
cttaaaacct 
aataatt 



tgctggagag 
tcaagagaac 
cgtcaagggg 
caaactgcat 
gttgtgtttt 
tacgggcata 
aaacacagag 
tgaccgtgtg 
gaaaaacaaa 
gactttggtc 
ttcctctttg 
tcacaaatta 
cacactggag 
agaaattgat 
caaacctagt 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
46'20 
4680 
4727 



<210> 57 » 

<211> 3852 • • 

<212> DISIA 

<213> Homp sapiens 

<220> 

<221> misc_:f eature 

<223> Incyte ID ISTo: 4599654CB1 



<400> 57 

cgccggcgat 

gctctggagg 

agcggggtgg 

acccctggac 

ccgatcccca 

acggtcaaca 

gagcgggtga 

tgggacctga 

accttcttca 

ttcttcctac 

gagatcctgc 

gacctcatct 

ttggacgctg 

ctaagcctgc 

gagatcttcc 

gggatgatgc 

caggacttcc 

cgccagtatt 

cagcaggcac 

gccacatgct 

tcccggcgtc 

ctgccagcag 

atgttcgatg 

aacttcacct 

ttcgtcactg 

gtgcgtgagg 

ctggcccgcg 

ctgctaacta 

tcactcagcg 

tttgagactg 

cggaagcgct 

caacatgaca 

cttagtggaa 

gtgacctcca 

gactctccag 

ccgctggtgc 

gcccgaaccc 

ccccctccag 

caaccctctc 

agtgagcggc 



tccgagccta 
ggttgcgggt 
gcgccatgga 
tggaggcggt 
aatctgggcc 
agttctccct 
agtcagcggg 
tcatgctgct 
aggaggagaa 
tggatctggt 
tggcaccgcg 
cttctatccc 
aggtctacaa 
tgaggctgct 
acatgaccta 
tgctgctatg 
ctcccgactg 
cccatgccct 
ctgtaggcat 
acgccatgtt 
agtaccagga 
acacgcggca 
aggaaagcat 
gtcggggcct 
cagttctcac 
gctccgtggg 
gcgcccggga 

ggggccggcg 

tggaccattt 
tggccatgga 

ccgagccaag 
gagacatggc 
agccagtact 
atgtggccat 
ccaccctcct 
ccgtccgagc 
tgcacgccag 
gaggaggtgg 
tgcctcagcg 
tgcctccctc 



cga[cgcctcc 
acctgatggc 
ggcagagcag 
gcctcccgtt 
tgagcctaag 
tcgggtgttc 
ggcctggatc 
gctgatggtg 
ctccccgcct 
gctcaacttc 
ggccatccgc 
tgtggattac 
aacggcacgg 
ccgcctctcc 
tgacctggcc 
tcactgggat 
ctgggtctcc 
gttcaaggcc 
gcccgacgtc 
catcggccat 
gaagtacaag 
gcgcatccac 
cctgggcgag 
ggtggcccac 
caagctgcgc 
gaggaagatg 
cacacgcctc 
cacagccagt 
caatgctgtg 
tcggctgctc 
tccaggcagc 
tcggggtgtt 
gtgggagcca 
tgccctgact 
tgctcgctct 
tggcccatgg 
cctatcccgg 
acggcggcta 
ggcaacaggc 
agggctcctg 



gctagagccc 
cacagagggc 
cggccggcgg 
gctcccccgc 

ggcagccaca 
atccacccct 
gggaacctca 
tggatcgtct 
cgaacgggca 
acgcgctacc 
atcttcctag 
gccctacgca 
cgcctcatcc 
agtgctgtgg 
ggGtgtctgc 
atcaaccaca 
atgagccaca 
tggctcacca 
gccacggcac 
caggtggagc 
gagtactatg 
ctgagcgagc 
atgccgctgt 
tttgaggtct 
tacttcatcc 
accgatggat 
gttcgggctg 
cttgaggagt 
cgcatcggca 
agtggtggca 
cggggtcggg 
ctggtacatg 
catcagcggg 
gcttggcgct 
gcatccacct 
gcagggcgct 
ggacctcggg 
gatggctctc 
gccaaacctc 



gcggggctgc 
tctaggaggc 
cgggggccag 
ctgcgaccgc 
ttgggacgct 
aagcagtgga 
acagcgactt 
tcgtcctgcc 
tcaacgtatt 
tcg.tggtgga 
tgcgcacctg 
tggtggagct 
tcgttcgctt 
gctacataca 
ttcgcatctt 
agttcctggt 
tg^tgaacca 
tgctgtgcat 
tgctcagcat 
t^atccagtc" 
agtacatgtc 
agcaccgcta 
cgcttcgcga 
ttgcccatgc 
tccagccggg 
agcatgggct 
Gctactttgg 
acacctactg 
tccccatgat 
agaagaattc 
tcatggagca 
ccccgagcac 
cgccccttca 
gccctctgcc 
cagcaggctc 
cccgcctgcc 
cccaggtctc 
gccgcccact 
ctgggcgtaa 
caaggacagc 



gccgactcct 
cgagcgtlgta 
cgaaggggcg 
ggcctcaggt 
gctccagcct 
aatcgagcag 
ccggttttac 
tgtgggcatc 
gtctgatact 
ggagg&tgct 
gttcctggtt 
ggagccacgg 
caccaagatc 
ccagtgggag 
caacctcatt 
gcccatgctg 
ctcgtggggc 
tggctatggg 
gatcgtaggt 
cctggactct 
cttccacaag 
ccagggcaag 
ggagatcatt 
cgaccccagc 
ggatctcgtg 
gctcagtgtg 
ggagatctgc 
ccgcctttac 
gcgccgggcc 
catactgcag 
gcatttggtg 
aggagctcag 
ggcagctgct 
cctctcccct 
tccagcttcc 
cgccccacct 
cctgctgggt 
ctcagcctcc 
gggatcagga 
ccagcccccc 



60 • 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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aggccaccag 
aacctttgag 
tgcctcttgg 
gcggacattc 
agcttgtccc 
caaggtaata 
ctgacaagga 
acctttacta 
tttccctgcc 
catagccagg 
tgagtgtggg 
tctgagtggg 
ttgtaaactg 
ttaaggggta 
gcaaatggct 
taattgtatt 
ctttatccat 
gcctctccat 
tctcccaggt 
agagttttgt 
ccaaggtatic 
agagtagtcc 
actgtgggac 
ggtgtgtggg 
acactcaaaa 



tgcctgagcc 
tacatccagc. 
ggaaggccat 
cgcatactgc 
atcataatcc 
tgccaggcgc 
itattcccttg 
agcacaagta 
aatgatcctg 
cactgccagt 
tatccacttt 
aacatacttc 
cgagcttcct 
gttggcacct 
gaatccttgt 
tattcattta. 
ccctcccctc 
'gccagcctct 
aggggc^g&a 
tttgtttttt 
caagcctggg. 
tttaccaccc 
caggacccta 
'agaagtttta 
aa 



agccacaccc 
cttagttctt 
g^i&gacctga 
catgaagacg 
attcjacccgt 
tgtatgtctc 
gctatggtcc 
cttgccactg 
caggttctgc 
cacctgtgcc 
tacccgctca 
taagttccct 
cttccctgta 
cctt^attaat 
gtgatatttt 
ctaactttat 
Gttttcaggt 
gtggtccttg 
ggagcc:acat 
ccttctgagt 
gaagggcagg 
tcaetctgcc 
agtctccttt 
taattgcttc 



cggggtctcc 
ggggtgcagt 
aacattgc.cc 
gtctctgtgt 
tcatcatgtg 
cactgccaag 
tgccaggtgc 
ccatcacfcgc 
ccggtctggt 
cccattgctg 
Gtgccacctg 
gcagtctctg 
ccctctgccc 
Atgccagcct 
tttcttcgct 
gtgttaccaa 
aaggagacag 
cccaaaccca 
gagagaggga 
ttgctgttgg 
ccagccagca 
tgtcccctct 
ccttctgggt 
caa^.cagctg 



agctttctgc 
agtatgtacc 
catggaaat^ 
cctcagctca 
tactgagcag 
tagaagtgac 
aggcccaggc 
caagtaacta 
tatcttcctg 
tcagcagatg 
tggacactct 
tcotgtggta 
cagtcgtgac 
•agatcccccc 
tgtttattta 
ttaattttgt 
gaggagtagg 
tcagcgcaat 
gaaggaccgc 
tgcaggaata 
cctctgcctt 
cctactctac 

ggggagttct 

ggtttaiaat& 



caacatgtaa 

caagggcaga 

tcgaccctgt 

agaatectgt 

Gtaccptgtt; 

tcadaaccct 

ccatgacccc 

gatgtctctg. 

tttcctgtag 

tcttgggtcc 

gtgtctaccc 

gaccatctt,t 

cccctaaaag 

cggtggaggg 

ttcatttatt 

ttacccattc 

aggaggcagg 

acttgaacct 

gtttaccttt 

agggaaaggc 

ctcagggaca 

agcattaaag 

aggggttctt 

taaaatagac 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
32'40 
33 00 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3852 



<210> 58 
<211> 1917 
<212> DNA , 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



5047435CB1 



<400> 58 

atggcagaag 

gtggcgttgg 

ccggaaagtt 

ccctcagcgc 

ctgcctttgt 

gtcatatcag 

gagttcctgg 

ctcattgact 

ggcagcctga 

gttggcttcg 

gggccacggc 

ctgctctcct 

ttcggctggg 

ggtacagatg 

aagctgggcc 

atgcgaggcc; 

cccaacgtgc 

tcagccgtgc 

atggggctgg. 

gccctgtccg 

agctgtctgg 

ctgcaggact 

ttgtccactg 

cggctggccc 

ctgcgctgga 

gggccagtga 

ttcgccttct 

gatctcattg 

ctcggcctgg 

atagaccagc 

actggcatcc 



gtgaaagggg 
cggcgctgcg 
tgtccggcgg 
ccccagcacg 
gtgcctctgt 
gtgccctgct 
tgggcagcct 
gctatggcag 
ccctgggcct 
ccatttccct 
agcggggagt 
atgccctcaa 
ccactgcacc 
agactgcaac 
cggggaggcc 
ggaccacagt 
tgtgctatgc 
tggcctctgt 
tggaccgtgc 
tcagtggcat 
ctgtgcccaa 
cctcfcctacc 
ctaagaaaac 
tgagctctgc 
ccgcactgct 
cctggcttgt 
gcaacagctt 
gcaccatcgg 
gcttcatcta 
agttccagaa 
cgtacagccg 



agcagacgtg 
cgcgggaggg 
cagcggcgtt 
ccgccgagtc 
gtctttgctg 
gccactgcag 
gctcctgggg 
gaagcaagcc 
ggctggttcG 
ctcctccatg 
gctggtgtcc 
ctatgcactg 
tgctgtcctg 
acacaaggac 
acggtactcc 
gggcctgggg 
ctccaccatc 
ggggcttggc 
aggccgcagg 
aggcctcgtc 
tgccaccggg 
tcccattcca 
caagccccat 
cctccctggg 
gtgcctgatg 
cctcagcgag 
caactgggcg 
cttgtcctgg 
tttatttgtt 
gagacggttc 
catcgagatc 



ccacatggcc 
cagggcagga 
ggggactccg 
ccgctcgcca 
ggtggcctga 
cttgactttg 
gctctcctcg 
atcctcggga 
ctggcctggc 
gcttgctgta 
ctctatgagg 
gctggtaccc 
caatccctca 
ctcatcccac 
tttctggacc 
ctggtgctct 
ttcagctccg 
gcagtgaagg 
gctctgttgc 
agctttgccg 
cagacaggcc 
aggaccaatg 
cccagatctg 
ccccctctgc 
gtctttgtca 
atctaccctg 
gccaacctct 
accttcctgc 
cctgaaacaa 
accctgagct 
tctgcggcct 



tcggggcctg 
gggacagagg 
gcgggggatg 
tgggccactc 
cctttggtta 
ggctaagctg 
cctccctggt 
gcaaattggt 
tggtcctggg 
tctacgtgtc 
Gaggcatcac 
Gctggggatg 
gcctcctctt 
tccagggagg 
tcttcagggc 
tccagcaaLct 
ttggtttcca 
tggcagctac 
tagctggctg 
tgcccatgga 
tcGGtggaga 
aggaGcaaag 
gagaGCGctc 
ccgGtGgggg 
gtgGcttctG 
tggagatacg 
toatcagcct 
tctacggact 
aaggccagtc 
ttggccacag 
cctgaggaat 



gctggccgac 
cgggggcggg 
cgcgcGGggc 
cccacctgtG 
tgaactggca 
Gttggagcag 
tggtggGttc 
gctgctggca 
ccgcgc tgtg 
agagctggtg 
cgtgggGatc 
gaggcacatg 
cctccctgct 
tgaggccccc 
acgcgataac 
aacagggcag 

tsrergggatcc 

cctgaccgcc 
tgccctcatg 
GtcaggGcca 
ctctggoctg 
ggagccaatG 
agcccGtcct 
gGatgGactg 
Gtttgggttt 
aggaagagGG 
ctccttcctc 
gaccgctgtc 
gttggcagag 
gcagaactcc 
ccgtctgcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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ggaakttctg gaactgtggc tttggcagac catctccagc atcctgcttc ctaggcc 1917 

. <210> 59 . ■ • . " 

<211> 6791 - . . 

<212> DNA. . , • . . 

<213> Homo sapiens . . • . 

<220> ■ . * 

<221> misc„f eature . • 

<223> Incyte ID No:. 7475603CB1 

<400> 59 , ... 

cgcgctccct gcctgctgct gggcggaggg aaggcggcaa gagctgcg^a gcccctggaa 60. 
gagcttccag gaaccctgcg ctgtgggata aaggaatgag gttcagaaag gggcaggagt 120 . 
tgcccgcagc cgcaccgcac gtcttcagcc cgaccgttgt cctgacctct ctgtcccgtc 180. 
ccctgcccag tctcaccatg gccttctgga cacagctgat gctgctgctc tggaagaatt 240 
tcatgtatcg ccggagacag ccggtccagc tcctggtcga attgctgtgg cctctcttcc 300 
tcttcttcat cctggtggct gttdgccact cccacccgcc. cctggagcac catgaatgcc 3 60 
acttcccaaa caagccactg ccatcggcgg gcaccgtgcc ctggctccag ggtctcatct 420 
'gt,aatgtgaa caacacctgc tttccgcagc tgacaccggg cgaggagccc gggcgcctga 480 
gcaacttcaa cgactccctg gtctcccggc tgctagccga tgcccgcact gtgctgggag 540 
gggccagtgc ccacaggacg ctggctggcc tagggaagct gatcgccaqg ctgagggctg* 600 
cacgcagcac ggcccagcct caaccaacca agcagtctcG actggaacca cccatgctgg . 660 
, atgtcgcgga gctgctgacg tcactgctgc gcacggaatc cctggggttg gcactgggcc 720 
aagcccagga gcccttgcac agcttgttgg aggccgctga ggacctggcc caggagctcc 780 
tggcgctgcg cagcctggtg gagcttcggg cactgctgca gagaccccga gggaccagcg 840 
gccccctgga gttgctgtca gaggccctct gcagtgtcag gggacctagc agcacagtgg 900 
gcccctccct caactggtac gaggctagtg acctgatgga gctggtgggg caggagccag 9 60 
aatccgccct gccagacagc agcctgagcc ccgcctgctc ggagctgatt ggagccctgg 1020 
acagccaccc gctgtcccgc ctgctctgga gacgcctgaa gcctctgatc ctcgggaagc 1080 
tactctttgc accagataca ccttttaccc ggaagctcat ggcccaggtg aaccggacct 1140 
tcgaggagct caccctgctg agggatgtcc gggaggtgtg ggagatgctg ggaccccgga 1200 
tcttcacctt catgaacgac agttccaatg tggccatgct gcagcggctc ctgcagatgc 1260 
aggatgaagg aagaaggcag cccagacctg gaggccggga ccacatggag gccctgcgat 1320 
cctttctgga ccctgggagc ggtggctaca gctggcagga cgcacacgct gatgtggggc 1380 
acctggtggg cacgctgggc cgagtgacgg agtgdctgtc cttggacaag ctggaggcgg 1440 
caccctcaga ggcagccctg gtgtcgcggg ccctgcaact gctcgcggaa catcgattct 1500 
gggccggcgt cgtcttcttg ggacctgagg actcttcaga ccccacagag cacccaaccc 1560. 
cagacctggg ccccggccac gtgcgcatca aaatccgcat ggacattgac gtggtcacga 1620 
ggaccaataa gatcagggac aggttttggg accctggccc agccgcggac cccctgaccg 1680 
acctgcgcta cgtgtggggc ggcttcgtgt acctgcaaga cctggtcrgag cgtgcagccg 1740 
tccgcgtgct cagcggcgcc aacccccggg ccggcctcta cctgcagcag atgccctatc 1800 
cgtgctatgt ggacgacgtg ttcctgcgtg tgctgagccg gtcgctgccg ctcttcctga 1860 
cgctggcctg gatctactcc gtgacactga cagtgaaggc cgtggtgcgg gagaaggaga 1920 
cgcggctgcg ggacaccatg cgcgccatgg ggctcagccg cgcggtgctc tggctaggct 1980 
ggttcctcag ctgcctcggg cccttcctgc tcagcgccgc gctgctggtt ctggtgctca 2040 
agctggggga catcctcccc tacagccacc cgggcgtggt attcctgttc ttggcagcct 2100 
tcgcggtggc cacggtgacc cagagcttcc tgctcagcgc cttcttctcc cgcgccaacc 2160 
tggctgcggc ctgcggcggc ctggcctact tctccctcta cctgccctac gtgctgtgtg 2220 
tggcttggcg ggaccggctg cccgcgggtg gccgcgtggc cgcgagcctg ctgtcgcccg 2280 
tggccttegg cttcggctgc gagagcctgg ctctgctgga ggagcagggc gagggcgcgc 2340 
agtggcacaa cgtgggcacc cggcctacgg cagacgtctt cagcctggcc cagg1;ctctg 2400 
gccttctgct gctggacgcg gcgctctacg gcctcgccac ctggtacctg gaagctgtgt 2460 
gcccaggcca gtacgggatc cctgaaccat ggaattttcc ttttcggagg agctactggt 2520 
gcggacctcg gccccccaag agtccagccc cttgccccac cccgctggac ccaaaggtgc 2580 
tggtagaaga ggcaccgccc ggcctgagtc ctggcgtctc cgttcgcagc ctggagaagc 2640 
gctttcctgg aagcccgcag ccagccctgc gggggctcag cctggacttc taccagggcc 2700 
acatcaccgc cttcctgggc cacaacgggg ccggcaagac caccaccctg tccatcttga 2760 
gtggcctctt cccacccagt ggtggctctg ccttcatcct gggccacgac gtccgctcca 2820 
gcatggccgc catccggccc cacctgggcg tctgtcctca gtacaacgtg ctgtttgaca 2880 
tgctgaccgt ggacgagcac gtctggttct atgggcggct gaagggtctg agtgccgctg 2940 
tagtgggccc cgagcaggac cgtctgctgc aggatgtggg gctggtctcc aagcagagtg 3 000 
tgcagactcg ccacctctct ggtgggatgc aacggaagct gtccgtggcc attgcctttg 3 060 
tgggcggctc . ccaagttgtt atcctggacg agcctacggc tggcgtggat cctgcttccc 3120 
gccgcggtat ttgggagctg ctgctcaaat accgagaagg tcgcacgctg atcctctcca 3180 
cccaccacqt ggatgaggca gagctgctgg gagaccgtgt ggccgtggtg gcaggtggcc 3240 
gcttgtgctg ctgtggatcc ccactcttcc tgcgccgtca cctgggctcc ggctactacc 3300 



81/87 



BNSOOCID: <WO_0204520A2_L> 



wo 02/04520 ' PCT/US01/21«8 

tgacgctggt gaaggcccgc ctgcccctga ccaccaatga gaaggctgac actgacatgg 3360 
agggcagtgt ggacaccagg caggaaaaga agaatggcag ccagggcagc agagtcggca 3420 
ctcctcagct gctggccctg gtacagcact gggtgcccg^ ggcacggctg gtggaggagc 3480 
tgccacacga gctggftgctg gtgctgccct acacgggtgc ccatgacggc agcttcgcca 3540 
cactcttccg agagctagac acgcggctgg cggagctgag gictcactggc tacgggatct 3 600 
ccgacaccag cctcgaggag atcttcctga aggtggtgga ggagtgtgct. gcggacacag 3 660 ; 
atatggagga tggcagctgc gggcagcacc tatgcacagg cattgctggc ctagacgtaa-3720 
ccctacggct caagatgccg ccacaggaga cagcgctgga gaacg&ggaa ccagctgggt 3780 
cagccccaga gactgaccag ggctctgggc cagacgccgt ^ggccgggta cagggctggg 3840 
cactgacccg ccagcagctc caggccctgc ttctcaagcg ctttctgctt gcccgccgca 3900 
gccgccgcgg cctgttcgcc cagatcgtgc tgcctgccct ctttgtgggc ctggccctcg 3960 
tgttcagcct catcgtgcct cctttcgggc acti^icccggc tctgcggctc agtcccacca 4020 
tgtacggtgc tcaggtgtcc ttcttcagtg aggacgcccc aggggaccct ggacgtgccc 4080 
ggctgctcga ggcgctgctg caggaggcag gactggagga gccccaagtg cagcatagct 4140 
cccacaggtt ctcggcacca gaagttcctg ctgaagtggc caaggtcttg gccagtggca, 4200 
actggacccc agagtctcca tccccagcct gccagtgtag ccggcccggt gcccggcgcc. 4260 
tgctgcccga ctgcccggct gcagctggtg gtccccctcc gccccaggca gt^accggct. 4320 
ctggggaagt ggttcagaac cagacaggcc ggaacctgtc/ tgacttcct'g gtcaagacct 4380 
pcccgcgcct ggtgcgccag ggcctgaaga ctaagaagtg ggtgaatgag gtcagatacg 4440 
gaggcttcte gctggggggc cgagaoccag gcctgccctc gggccaagag ttgggccgct 4500 
cagtggagga gttgtgggcg ctgctgagtc ccctgcctgg cggggccctc gaccgtgtcc 4560 
tgaaaaacct cacagcctgg gctcacagcc tggatgctca ggacagtctc aagatctggt 4620 
tcaacaacaa aggctggcac tccatggtgg cctttgtcaa ccgagccagc aacgcaatcc 4680 
> tccgtgctca cctgccccca ggcc.cggccc gccacgccca cagcatcacc acactcaacc 4740 
accccttgaa cctcaccaag gagcagctgt ctgaggctgc actgatggcc tcctcggtgg 4800 . 
acgtcctcgt ctccatctgt gtggtctttg ccatgtcctt tgtcccggcc agcttcactc 4860. 
ttgtcctcat tgaggagcga gtcacccgag ccaagc^cct gcagctqatg. gggggcctgt 4920 
cccccaccct ctactggctt ggcaactttc tctgggacat gtgtaactac ttggtgccag 4980 
catgcatcgt ggtgctcatc tttctggcct tccagcagag ggcatatgtg gcccctgcca 5040 
acctgcctgc tctcctgctg ttgctactac tgtatggctg gtcgatcaca ccgctcatgt 5100 
acccagcctc cttcttcttc tccgtgccca gcacagccta tgtggtgctc acctgcataa 5160 
acctctttat tggcatcaat ggaagcatgg ccacctttgt gcttgagctc ttctctgatc 5220 
agaagctgca ggaggtgagc cggatcttga aacaggtctt ccttatcttc ccccacttct 5280 
gcttgggccg ggggctcatt gacatggtgc ggaaccaggc catggctgat gcctttgagc 5340 - 
gcttgggaga caggcagttc cagtcacccc tgcgctggga ggtggtcgrgc aagaacctct 5400 
tggccatggt gatacagggg cccctcttcc ttctcttcac actactgctg cagcaccgaa 5460.. 
gccaactcct gccacagccc agggtgaggt ctctgccact cctgggagag gaggacgagg 5520 
atgtagcccg tgaacgggag cgggtggtcc aaggagccac ccagggggat gtgttggtgc 5580 
tgaggaactt gaceaaggta taccgtgggc agaggatgcc agctgttgac cgcttgtgcc 5640 
tggggattcc ccctggtgag tgttttgggc tgctgggtgt gaatggagca gggaagacgt 5700 
ccacgtttcg catggtgacg ggggacacat tggccagcag gggcgaggct gtgctggcag 57 60 
gccacagcgt ggcccgggaa cccagtgctg cgcacctcag catgggatac tgccctcaat 5820 
ccgatgccat ctttgagctg ctgacgggcc gcgagcacct ggagctgctt gcgcgcctgc 5880 
gc^gtgtccc ggaggcccag gttgcccaga ccgctggctc gggcctggcg cgtctgggac 5940 
tctcatggta cgcagaccgg cctgcaggca cctacagcgg agggaacaaa cgcaagctgg 6000 
cgacggccct ggcgctggtt ggggacccag ccgtggtgtt tctggacgag ccgaccacag 6060 
gcatggaccc cagcgcgcgg cgcttccttt ggaacagcct ti:ttggccgtg gtgcgggagg 6120 
gccgttcagt gatgctcacc tcccatagca tggaggagtg tgaagcgctc tgctcgcgcc 6180 
tagccatcat ggtgaatggg cggttccgct gcctgggcag cccgcaacat ctcaagggca 6240 
gattcgcggc gggtcacaca ctgaccctgc gggtgcccgc cgcaaggtcc cagccggcag 63 00 
cggccttcgt ggcggccgag ttccctgggg cggagctgcg cgaggcacat ggaggccgcc 63 60 
tgcgcttcca gctgccgceg ggagggcgct gcgccctggc gcgcgtcttt ggagagctgg 6420 
cggtgcacgg cgcagagcac ggcgtggagg acttttccgt gagccagacg atgctggagg 6480 
aggtattctt gtacttctcc aaggaccagg ggaaggacga ggacaccgaa gagcagaagg 6540 
aggcaggagt gggagtggac cccgcgccag gcctgcagca ccccaaacgc gtcagccagt 6600 
tcctcgatga ccctagcact gccgagactg tgctctgagc ctccctcccc tgcggggccg 6660 
cggggaggcc ctgggaatgg caagggcaag gtagagtgcc taggagccct ggactcaggc 6720 
tggcagaggg gctggtgrccc tggagaaaat aaagagaagg ctggagagaa gccgtggtgg 6780 
tgaastaaaaa a . 6791 

<210> 60 
<211> 5214 

<2i2> umi 

<213> Homo- sapiens 

<220> . • 

<221> misc_feature 
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<400> 60 

atgctcaaaa 

gatgagtctc 

ttgctgcgca 

ttcgagcact 

ctctacacgg 

tcctatgtga 

tGtttggtgc 

atgttgcgga 

gaactgccaa 

gtttccattt 

ggaacattta 

agtttagcta 

cctggattta 

agtggcttta 

ggctgggtgt 

tatttcatca 

atcattgaaa 

agcactacct 

ctggtagctg 

atgeggtcat 

gcggctagca 

gcggaggtgg 

ggatttactg 

actactcttc 

cgagtagttc 

ggtcctggaa 

tcagcaatta 

ccgagggcat 

gaccaaactc 

ctctatcatc 

aacttagaac 

gcggacacca 

cctcaaatgg 

agttttatga 

ctcccgacca 

aaatacatcg 

gaaaaggaga 

ggatcatttg 

cgccaagagc 

tctcttgaaa 

gaagtcagtg 

aagtgcaaaa 

gggcgagaac 

tctaaaacag 

ttgctgggat 

atttccatga 

gctgagtatg 

ggcttatttt 

atatatcttg 

ggagctcagc 

ccccagatga 

tccattcttt 

ggaaaactgg 

ttcagaatta 

aaacctggat 

gtgggaaacg 

agagatgtta 

ttcctgggtt 

gaaaacaagg 

cgactgaaga 

gctaaaatgt 

ctggcccagt 

ttggcaacaa 

atagcaatgt 



ggaagcagag 

tgt.cggata^ 

tctgtgccat 

^tcctccact 

cagagatgat 

aagatcgctg 

tacaggtgtt 

ttccacggcc 

ggaccagaat 

ttctactttt 

cttatcactg 

ttccagacac 

aatgqatgga 

atgagatagg 

tcctcatgt'a 

ctctcatttt 

catttgcaga 

caac.agccac 

tggatgtcaa 

ccgttttcca 

actactacaa 

cttttacagt 

gatatattag 

atgta'taccc 

ggctgattaa 

aa^agcttgg 

gtttgcagat 

ttatgtccat 

taaatgctgt 

tttttgccac 

ttgatgaaga 

aagaaaagct 

tgaaaatctc 

agcagtttat 

cctcttcctc 

accaaaaact 

ccgcagtcac 

aggggcagcc 

gcaggtcact 

ctttgactca 

aaataaagat 

aggaactcag 

acaggttcag 

accctgtcac 

tggtcactta 

tgtttgagtc 

tgtttgtgat 

tcactccaac 

tgagcttgat 

ttctaatggt 

ggaaagttgt 

tgctgacatt 

ccaagtgcaa 

atgtcagtgt 

tttgggtgcc 

ctatgctggc 

ttattcatcg 

gcatgattgg 

ggacggcttt 

tcgcacagcc 

atgacataac 

cggtgttgct 

tgtcagttgt 

cgcctgctgg 



ttccagggtg 

tgct^gacatc 

catcagcgtc 

tcagtatg tg 

agcaaaaatgr . 

gtgtgttttt 

tgaaattgct 

actgattatg 

tacaaatatt 

qtttctactt 

tgttptaaat 

acactgctca 

ccttgaagat 

aactagtata 

cagagcaatt 

cttcct'cgcc 

aatcagagta 

cacccagatg 

caagccccag 

catgttcatc 

aggagaaaac 

actttttgat 

ctcatctctc 

agatcitttat 

gat'ttcacct 

gagtttggtt 

gttctgcttt 

gttccagatc 

gggacatatg 

tctgatcctc 

cctaaagaag 

ccctttacgc 

aaagcttcct 

tgaccgccag 

ctcctgcgac 

tcgcaagtct 

taaaatctta 

cgcaaaggag 

aagacatgga 

agatcattgc 

gattcaggaa 

agagagccap 

aaacttttgc 

aggagctgtg 

cctggactgg 

cccgtttcga 

attcatgagc 

tgctgtcatc 

atttctttgt 

ccttcggtgc 

tcgagaactt 

aatgctcgtt 

tgatcccaac 

gtcaaagaac 

ccgtgtttgg 

gttgtttgaa 

tgtggggccg 

actgaccctt 

gctgaccgtc 

tcttcatctt 

ccagcatcca 

ctctgtcaag 

tttcaccttc 

cttctggcaa 



gaagcccagc 

ctctggatta 

atttctgttt 

acCttcactt 

cacatccggg 

gatggattta 

gatpitagttg 

atccgagcat 

ttaaagcgat 

ctttatgga^ 

gacacaaagc 

ccagagctag 

ctgggactta 

ttcaccgtct 

gacagctttc 

tggcttgtga 

cagtttcaac 

tttcatgaag 

ggaicgcgccc 

ctgagcatgg 

ttcaggaggc 

ttggaagcac 

cacaaattcg 

cattcacaat 

gcattagaag 

gtatttactg 

gtcgaagaac 

ctcacccagg 

tgggcacGcg 

ctgagtttgt 

cttaaacaat 

ctgcgaatct 

tcagatttt;a 

caacaggaca 

cactccaaac 

gttttcagca 

agggcttgca 

aggtcaatcc 

tcaaacagcc 

aatacagtga 

aaaaaggagc 

ccatacttcg 

cgggtggtgg 

aaaaatacaa 

gtcatgatca 

agagtcatgc 

attgagctta 

agggacttcg 

tggatgcctc 

ctgagacctc 

ttcagcggct 

tttgcaagct 

attattagaa 

ttaaatttaa 

gcgaatcctc 

gttctctcct 

atccatggaa 

tttgttggag 

gatcagagaa 

ccgcctcgcc 

ttttttaaga 

tgggacgtcg 

atctttgttc 

agcagaagaa 



cagtcactga 

acaaaccatg 

^tatgaatac 

tggatacatt 

gcattgtcaa 

tggtcttttg 

atcagatgtc 

tccggattta 

cgggagaaca 

ttttaggagt 

'cagggaatgt 

aagaaggcta 

gcaggcaaga 

atgaggccgc 

cccgttggc'g 

agaacgtgtt 

aaatgtgggg 

atgctgctgg 

cagcctgcct 

tgaccgtgga 

agtacgacga 

ttctgaagat 

aactactact 

tcacgtactt 

act;ttgtgta- 

ccagcctctt 

tggacag^tt 

aaggatgggt 

tggttgccat 

ttgttgctgt 

taaagcaaag 

ttgaaaaatt 

cagttcctaa 

catgttgcct 

gctcagcaat 

tcagggcaag 

cccgacagcg 

tcagcgtgca 

agaggatcag 

tatatagaaa 

ta^cagagat 

ataagccact 

tccgagcacg 

agtaccatct 

tcgtaacctc 

atgcacctac 

atctgaagat 

gtggagtaat 

aaaatgtacc 

tgcgcatatt 

tcaaggaaat 

ttggagttca 

gggaagattg 

aattgaggcc 

ggaactttaa 

tgaaaggctg 

tctatattca 

tagttattgc 

gatgggaaga 

cggataatga 

ggacaatcgc 

aggacccggt 

tggaggtaac 

accgatacga 



ctttggtcct 
ggttcactct 
gcca'atgacc' 
attgatgttt 

gggggatagt 

cctttgggtt 

accttggggc 

tttccgattt 

aatatggagt 

tcagatgttt 

aacctggaat 

ccagtgccca 

gctgggctac 

ctcacaggaa 

ttcctacttc 

tattgctgtt 

atcgagaagc 

aggttggcag 

ccagaaaatg 

cgtgatcgtg 

gttctacctg 

atggtgtttg 

cgtaattgga 

tcaggtactc 

caagatajttt 

gattgttatg 

tactacgttt 

ggacgtaatg 

ctatttcatt 

tattttggac 

tgaagcstaat 

tccaaacaga 

aatcagggag 

tctga^aagc 

tgaggacaac 

gaaccttctg 

catgctgagc 

gcatcatatc 

caggggaaaa 

tgctcaaaga 

gcttcaagga 

gttcattgtc 

cttcaacgcg 

tctttatgat 

tgactcttgc 

tttgcagatt 

tatggcagat 

ggacatattt 

tgctgaatcg 

caaactggtg 

ttttttggtc 

gctttttgct 

caatggcata 

tggagagaaa 

tttcgacaat 

ggtggaagtg 

tgtttttgta 

taatttcaat 

cctgaagagc 

tggttttaga 

attactcgtc 

gaccgtacct 

catgaagatc 

tctcctggtg 



60 • 

120 

180 

240 

300 

3 60 

420 

480 

540 • 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140. 

1200 

1260 

1320 

13 80 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

33 00 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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acgtcgcttg gcgttgtatg ggtggtgctt cactttgccc tcctgaatgc atatacttac 3900 

atgatgggcg cttgtgtgat. tgtatttagg tttttctcca tctgtggaaa acatgtaacg 3960 

ctaaagatgc tcctcttgac agtggtcgtc agcatgtaca agagcttctt tatcatagta 4020 

ggcatgtttc tcttgctgct gtgttacgct t.ttgctggag ttgttttatt tggtactgtg 4080 

aaatatgggg agaatattaa caggcatgc^ aatttttct;t^ cggctggaaa agctattacc 4140 

gtactgttc.c gaattgtcac aggtgaagac tggaacaaga tt;atgcatga ctgtatggta 4200 
cagcctcicgt tttgtactcc agatgaattt acatactggg caacagactg tggaaattat.4260 

gctggggcac ttatgbattt ctgttcattt tat^tcatca ttgcctacat catgctaaat 4320 

ctgcttgtag ccataattgt ggagaatttc tccttgattt attccactga ggaggaccag 4380 

cttttaagtt acaatgatct tcgccacttt caaataatat ggaacatggt ggatgataaa 4440 

agagaggtat tccecacgtt ccgcgtcaag ttcctgctgc ggctactgcg tgggaggctg 4500 

gaggtggacc tggacaagga caagctcctg t.ttaagcaca tgtgctacga aatggagagg 4560 

ctccacaatg gcggcgacgt caccttccat gatgtcctga gcatgctttc ataccggtcc 4620 

gtggacatcc ggaagagctt gcagctggag gaactcct;gg cgagggagca gctggagtac 4680 

accatagagg aggaggtggc caagcagacc atccgcatgt ggctcaagaa gtgcctgaag 474.0 

. cgcatcagag ctaaacagca gcagtcgtgc agtatcatcc acagcctgag agagagtcag, 4800 

cagcaagagc tgagccggtt tctgaacccg cccagcatcg agaccaccca gcccagtgag 4860 

gacacgaatg ccaacagtca ggadaacagc atgcaacctg. aigacaagca'g ccagcagcag 4920 

ctcctgagcc ccacgctgtc ggatcgagga ggaagtcggc aagatgcagc cgacgcaggg 4980 

ka^ccccaga ggaaatttgg gcagtggcgt ctgccctcag ccccaaaacc aataagccat 5040 

tcagtgtcct cagtcaactt acggtttgga ggaaggacaa ccatgaaatc tgtcgtgtgc 5100 

aaaatgaacc ccatgactga cgcggcttcc tgcgg.ttctg aagttaagaa gtggtggacc 5160 

cggcagctga . ctgtggagag cgacgaaagt ggggatgacc ttctggatat ttag . . 5214 

<210> 61 
<211> 1818 
<212> DNA 

<213> Homo sapiens ' 

<220> 

<221> inisc_f eature i . 

<223> Incyte ID No: 168827CB1 

<400> 61 

ggaaattgct tccgtgaccc tgctgcagat gggagagagg gcccattaag aagagagtgg 60 
ggtcaggatc aacacacaca cttagtgtga tttaaggaaa ggaaatattt tctctttgaa . 120 
cttatctgga tacagtcatt ttgtctcctc ttggggatca cttgtccagc ctcaatggcc 180 
tttcaggacc tcctagatca agttggaggc ctggggagat tccagatcct tcagatggtt 240 
ttccttataa tgttcaacgt catagtatac catcaaactc agctggaga:a cttcgcagca 3 00- 
ttcatacttg atcatcgctg ctgggttcat atactggaca atgacactat ccctgacaat 360 
gaccctggga ccctcagcca ggatgccctc ctgagaatct ccatcccatt cgactcaaat 420? 
ctgaggccag agaagtgtcg tcgctttgtc catccccagt ggaagctcat tcatctgaat 480 
gggaccttcc ccaacacgag tgagccagat acagagccct gtgtggatgg ctgggtatat 540. 
gaccaaagct ccttcccttc caccattgtg actaagtggg atctggtatg cgaatctcaa 600 
ccactgaatt cagtagctaa atttctattc atggctggaa tgatggtggg aggcaaccta 660 
tatggccatt tgtcagacag gtttgggaga aagttcgtgc tcagatggtc ttacctccag 720 
ctcgccattg taggcacctg tgcggccttt gctcccacca t^ctcgtata ctgctccctg 780 
pgcttcttgg ctggggctgc tacatttagc atcattgtaa atactgtttt gttaattgta 840 
gagtggataa ctcaccaatt ctgtgccatg gcattgacat tgacactttg tgctgctagt 900. 
attggacata. taaccctggg aagcctggct tttgtcattc gagaccagtg catcctccag. 960 
ttggtgatgt ctgcaccatg ctttgtcttc tttctgttct caaggtggct ggcagagtct 1020 
gctcggtggc tcattatcaa caacaaacca gaagagggct taaaggaact taca^aagct 1080 
gcacacagga atggaatgaa gaatgctgaa gacatcctaa ccatggaggt tttgaaatcc 1140 
accatgaagc aagaactgga ggcagcacag aaaaagcatt ctctttgtga attgctccgc 1200 
atacccaaca tatgtaaaag aatctgtttc ctgtcctttg tgagatttgc aagtaccatc 1260 
cctttttggg gccttacttt gcacctccag catctgggaa acaatgtttt cctgttgcag 1320 
actctctttg gtgcagtcac cctcctggcc aattgtgttg caccttgggc actgaatcac 13 80 
atgagccgtc gactaagcca gatgcttctc atgttcctac tggcaacctg ccttctggcc 1440 
atcatatttg tgcctcaaga aatgcagacc ctgcgtgtgg ttttggcaac cctgggtgtg 1500 
ggagctgctt ctcttggcat tacctgttct actgcccaag aaaatgaact aattccttcc 1560 
ataatcaggg gaagagctac tggaatcact ggaaactttg ctaatattgg gggagccctg 1620 
gcttccctca tgatgatcct aagcatatat tctcgacccc tgccctggat catctatgga 1680 
gtctttgcca tcctctctgg ccttgttgtc ctcctccttc ctgaaaccag gaaccagcct 1740 
cttcttgaca gcatccagga tgtggaaaat gagggagtaa atagcctagc tgcccctcag 1800 
aggagctctg tgctatag 1818 

<210> 62 . 
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<211> 2245 
<212> DNA * 

<213> Homo sapiens i » »• • 



->^^^^ .... , . ^ f , 

<221> misc_f leature 

<223> Incyte ID No: 7472734CB1 '* • 

<400> 62 

cccttggaac agtaggatgt tggtgatgca aaagtcaatg. tttaaactca acattccact 60 
ttcctttaac taagaatagt ttttattaac ttttagtaaa actcagtcct agtccaaaaa 120 
aagccctgct ctctgatctt tgtacaagaa catcataaag caattcactt tggattttc.t 180 ' 
aatatcccat ttctgagaag ^atggcagac tattgaacag gtgtatttta ggtcacgtgg 240 
ggactgcatc cacctgaaaa tcc^ccgttg ^cttatcagg ' aaactcagag atcaggatct 3 0fO 
ttcacagagt agtcttttaa gaagattcag ttgteaacag ctagcagtct ctttgccaaa 360 
taattatatc tgtgacttct gaaactattt ggctgcctaa agttaaagga cttggggaaa 420 
gtccctccac tgctcttctg cagtagtgtc acaccactca gtgcagggqc caccaagaag 480 
aaagcagtgt 'caggatccac atggcactat ggtaactttg tgaaagggga cattttctcc 540 
ctctgaactt ctcttc^tea agtcatLtgtg cttcctcttg gggatcacct gttcagtctc 600 
aatgggcttt gatgtgctcc tggatcaagt gggtggcatg gggagattcc agatttgtct 660 
gatagctttc ttttgcatca. ccaacatcct actgttccct aatattgtgt tggagaactt 720 
cactgcattc acccctagtc atcgctgctg ggtccccctc ctggacaatg acactgtgtc 780 • 
tgacaatgat accgggaccc tcagcaagga tgacctcctg agaatctcca tcccactgga 840 
ctcaaacqtg aggccacaga agtgtcagcg ctttatccat ccccagtggc agctccttca 900 
cctgaacggg accttcccca acacaaatga gccagacacg gagccctgtg tggatggctg 960 
ggtgtacgac agaagctctt tcctctccac catcgtgact gagtgggacc tggtatgtga 1020 
atctcagtca ctaaaatcaa tggtticaatc cctatttatg gctgggtcac ttctgggagg 1080 
tctaatatat ggccatcttt cagacaggtt tgggagaaag ttcgtgctca gatggt9tta 1140 
cctccagctc gc9attgtag gcacctgtgc ggcctttgct cccaccatcc tcgtatactg 1200 
ctccctgcgc ttcttggctg gggctgctac atttagcatc attgtaa^ta ctgttttgtt 1260 
aattgtagag tggataactc accaattctg tgccatggca ttgacattga cactttgtgc 1320 
tgctagtatt ggacatataa ccctgggaag cctggctttt gtcattcgag accagtgcat 13 80 
cqtccagttg gtgatgtctg caccatgctt tgtcttcttt ctgttctcaa ggtggctggc 1440 
agagtctgct cggtggctca ttatcaacaa caaaccagaa gagggcttaa aggaacttag 1500 
aaaagctgca cacaggaatg gaatgaagaa tgctgaagac atcctaacca tggaggtttt 1560 
gaaatccacc atgaagcaag aactggaggc agcacagaaa aagcattctc tttgtgaatt 1620 
gctccgcata cccaacatat gtaaaagaat ctgfcttcctg tcotttgtga gatttgcaag 1680 
taccatccct ttttggggcc ttactttgca cctccagcat ctgggaaaca atgttttcct 1740 
gttgcagact ctctttggtg cagtcaccct cctggccaat tgtgttgcac cttgggcact 1800 
gaatcacatg agccgtcgac taagccagat gcttctcatg ttcctactgg caacctgcct 1860 
tctggccatc atatttgtgc ctcaagaaat gcagaccctg cgtgtggttt tggcaaccct 1920 
gggtgtggga gctgcttctc ttggcattac ctgttctact gcccaagaaa atgaactaat 1980 
tccttccata atcaggggaa gagctactgg aatcactgga aactttgcta atattggggg 2040 
agccctggct tccetcatga tgatcctaag catatattct cgacccctgc cctggatcat 2100 
ctatggagtc tttgccatcc tctctggcct tgttgtcctc ctccttcctg aaaccaggaa 2160 
ccagcctctt cttgacagca tccaggatgt ggaaaatgag ggagtaaata gcctagctgc 2220 
ccctcagagg agctctgtgc tatag 2245 

<210> 63 

<211> 3196 

<212> DNA 

<213> Homo, sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473473CBi 

<400> 63 

gcggcggccg ggggagcgct actaccatga actgcctggt cctcctcccc agagctgctc 60 

atccgggtcg ggctggagac acagtcaggg gaccccgtcg ccgccgccgc gccccctctt 120 

ctttcggctc aatcttctct tccacctttt cctcctcttc ctccaccttc tttgcctgc^ 180 

tccccccctc ccccgccgcg gatcctggcc gctgctctcc agacccagga tgccgggggg 240 

caagagaggg ctggtggcac cgcagaacac atttttggag aacatcgtca ggcgctccag 300 

tgaatcaagt ttcttactgg gaaatgccca gattgtggat tggcctgtag tttatagtaa 3 60 

tgacggtttt tgtaaactct ctggatatca tcgagctgac gtcatgcaga aaagcagcac 420 

ttgcagtttt atgtatgggg aattgactga caagaagacc attgagaaag tcaggcaaac 480 

ttttgacaac tacgaatcaa actgctttga agttcttctg tacaagaaaa acagaacccc 540 
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tgtttggttt 
cctgtgtact 
aggttggacg 
gcagctcacg 
tcttcagctg 
acacattatt 
taccttctac 
catagcctgg 
aaattttcac 
aaggatgaac 
catcatcaat 
agtggtgcgt 
tggagcagca 
ctgcatatgg 
aatagacagt 
cagtgctggg 
ctttaccatg 
ggagaagatg 
tggaaatgtt 
gctgaataat 
agtcatggat 
cctctccatc 
ttttaatgaa 
agagttccaa 
ggatgccctc 
ggctatttta 
ccatgcatgt 
agccttgctc 
cactcttact 
agaggaggag 
cccagtcaga 
aacacagggt 
agcctccatc 
tctggcctat 
caagcccaac 
gaatggaaat 
aaaggaagac 
caagagcagt 
tgacagtgga 
gctagagcac 
ggccttacag 
cagctgcaga 
aaaaagcgta 
gataccatgt 
tgaaatccac 



tatatgcaaa 
ttcaaggata 
aaatttgocc 
ccaatgaata 
ggatcagata 
ttacattatt 

•accgccatta 
ctggtactgg 
acgactttcg 
tatctgaaaa 
gcctttgaaa 
ctcttacgac 
gtectcgtgc 
tatagcatcg 
tggctctacc 
atatgggaag 

^acaagcctta 
ttttcggtgg 
acaacaatitt 
gtacgggact 
tatattgtct 
tgtcccaagg 

, catcctgctt 
accattcact 
tgctttgtgg 
gggaagggtg 
gcgaacgtcc 
aaagtcctgg 
tgcaatct^a 
gagcgcctcc 
aagctcttcc 
gacGctgaga 
accggaacca 
gtgaaaacca 
ggcggtgctg 
ggaaaagggt 
tggaataatg 
gattcagaga 
attacaaaaa 
agtcccatcc 
accacactgc 
atgactgccc 
ccccaggcct 
caggatattt 
ttttaa 



ttgcaccaat 
ttacgttgtt 
gStt^&cacjg 
aaacagaggt 
tcc^tcctca 
gtgcttttaa 
tggttcctta 
atagtgtggt 

tggggcccgg 

cttggtttgt 
atgtggatga 
tgggccgtgt 
»tcctggtgtg 
gagactacga 
agctggcttt 
gaggacccag 
caaccatagg 
ctatg^tgat 
tccagcaaat 
tcctaaaact 
caacatggtc 
acatgagagc 
ttcgattggc 
gtgctcccgg 
tgtcaggatc 
atgtatttgg 
gggc^ctgac 
acttttatac 
ggaaacggat 
ggcagaagaa 
agaagttcaa 
ggaaccaact 
gcgtggtgac 
gtgaatccct 
accaaaaatg 
ggctgcgact 
tcactaaagc 
acagtgtgac 
gtgaccttcg 
aggctgatgc 
aggaagtcaa 
tagaaaagca 
catctcccaa 
ttagtgtctc 



aagaaatgaa 

caaacagcca 

ggcttt^aca 

ggtccataaa 

gtataaacaa 

aactacttgg 

taatgtttcc 

ggacgttatt 

tggagaggtc 

gatcgatctg 

gggaatcagc 

ggctaggaaa 

tgtgtttgga 

^gtcattgat' 

gagcattggg 

caaggattca 

atttggaaac 

ggttggcgct 

gtatgccaac 

ctatcaggtc 

catgtcaaaa 

tgatatctgt 

cagcgatggg 

ggacctcatt 

cttggaagtc . 

agacatcttc 

gtactgtgac 

agcttttgca 

catctttcgt 

tgaggtgacc 

♦gcagcagaag 

ccaggtagag 

tgtgtcacag 

taagcagaac 

tctcaaagtc 

caagaataat 

tga^tcaatg 

caaaaaccca 

tttggataag 

caagcacccc 

acacgaactc 

■ggtggcagaa 

atcccaaatg 

aaggcctgaa 



catgaaaagg 
atagaggatg 
aatagccgab. 
cattcaagac 
.gaagcgccaa 
gattgggtga 
ttcaa&acaa 
tttctggttg 
atttctgacc 
ctgtcttgtt 
agtctcttca 
ct^gaccatt 
ctggtggcc.c 
gaagtcacta 
actccatatc 
ttgtacgtgt 
atagctccta. 
cttctttatg 
accaaccgat 
ticaaaaggcc 
ggcattgata 
gttcatctaa 
tgtctgcgcg 
taccatgctig 
atccaggatg 
tggaaggaaa 
ctacacatca 
aactcctftct 
aagatcagtg 
ctcagcattc 
gagctgcgga 
agccgctcct 
attactccca 
aaccgtgatg 
aacagcccaa 
atgggagccc 
gggctattgt 
ctaagaaaaa 
gctggggagg 
ttttatccca 
aaagaggaca 
attttaaaaa 
ccactccaag 
tcacctgaat 



tggtcttgtt 

attcaacaaa 

gtgttttgca 

tagatgaagt 

agacgccacc 

ttttaattct 

agcagaacaa 

acatcgtttt 

ctaagctcat 

taccttatga 

gttctttaaa 

acctagaata 

actgg.ctggc 

acaccatcca 

gctacaatac 

cctctctcta 

ccacagatgt 

caactatttt 

accatgagat 

ttagtgagcg 

cagaaaaggt 

accggaaggt . 

ccttggcggt 

gagaaagtgt 

atgaggtggt 

ccacccttgc 

tcaagcggga 

caagga^tct 

atgtgaagaa 

ccgtggacca 

atcagggctc 

tacagaatgg 

ttcagacgtc 

ccatggaact 

taagaatgaa 

atgaggagaa 

ctgaggaccc 

cagattcttg 

cccgaagtcc 

tccccgagca 

tccagctgct 

tactgtcgga 

taccccccca 

ctgacaaaga 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

13'80 , 

1440 ' 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

216.0 

2220 

2280. 

2340 

'2400 

2460 

2520 . 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3196 



<210> 64 

<211> 1602 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



7477725CB1 



<400> 64 

atggcctttg 

ctggttttta 

gctgcagcca 

ggtaatgaaa 

tcaaatctga 

ctgaatggga 

gtatatgatc 

tatcagtcac 

atcataggtg 

ctccagcttg 

gtactacgct 



aggagctctt 
ttcttccctc 
ttcctggtca 
ctggaatcct 
ggccagagaa 
ctatccacag 
aaagctactt 
tgaaatcagt 
gccatgtctc 
ccattactga 
tcttggcagg 



gagtcaagtt 
tctcatgtta 
tcgttgctgg 
cagtgaagat 
gtgtcgtcgc 
cacaagtgag 
cccttcgacc 
ggttcaattc 
agacaggttt 
cacctgcgct 
tttttcttcc 



ggaggccttg 
ttaatccctc 
gtccacatgc 
gccctcttga 
tttgtccatc 
gcagacacag 
attgtgacta 
ctacttctga 
999cgaagat 
gccttcgctc 
atgatcatta 



ggagatttca 
atatactgct 
tggacaataa 
gaatctctat 
cccagtggca 
aaccctgtgt 
agtgggacct 
ctggaatgct 
ttattctcag 
ccaccttccc 
tatcaaataa 



gatgcttcat 60.. 
agagaac 1 1 1 120 
tactggatct 180 
cccactagac 240 
gcttcttcac 300 
ggatggctgg 3 60 
ggtatgtgat 420 
ggtgggaggc 480 
atggtgtttg 540 
tgtttactgt 600 
ttctttgccc 660 
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attactgagt 
cttagtattg 
ctgcacgtgg 
gaatctgctc 
aaagttgcac 
agatccacca 
ttqcgcdacc 
acaatacctt 
ttgcaggtac 
aatcatatgg 
ttggccaaca 
ggaatcggct 
cccactgttc 
gcactggctc 
tatggaatct 
ctgcctttgc 



ggataaggcc 
gacagataat, 
tggcgtctgt 
ggtggttgat 
gcacaaatgg 
tgcaggagga 
ccagtatgcg 
tttatggtac 
tttatggagc 
gccgtcgaat 
cgtttgtgcc 
gttctgctgc 
tcagggcaag 
ccctcttgat 
tccccatcat 
ctgacaccat 



caactctaaa 
cctgggaggc 
acctttcttt 
aatcaccaat 
aataaagaat 
gctggatgca 
taaaaggatc 
catggtcaat 
tgtcgctctc 
aagccagata 
caaagaaatg 
tactttttcc 
aigcttcagga 
gaccttaacg 
tggtggcctt 
caaggatgtg 



gccctggtag 
ttggcttatg 
gtcttctttc 
a^aactagatg 
gctgaagaaa 
gcacagacca 
tgtatcctgg 
cttdagcatg 
atagttcgat 
ttgttcatgt 
cagaccctgc 
agtgttgctg 
atagatttaa 
gtatttttjta 
attgtcttcc 
gaaaatcagt 



taatattgtc 
tcttccgaga 
ttctttcaag 
agggcttaaa 
ccctgaacat 
aaactalctgt 
tatttttgag 
t^gggagcaa 
gtcttgctct 
tcctggtggg 
gtgtggcttt 
ttcacttcat 
cggetagtdg 
ccactt^tgcc 
tcctaccaga 
ga 



atctggtgcc 
ctggcaaacG 
gtggctggtg 
ggcacttaga 
agaggttgta 
gtgtgacttg 
atttgcaaac 
cattttcctg 
tttgacacta 
cctttccatt 
ggcatgtctg 
tgaactcatc 
gattggagca 
atggatcatt 
aaccaagaat 
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